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Foreword 

These lecture notes were translated from the Finnish lecture notes for the TUT course "Graafi- 
teoria". The laborious bulk translation was taken care of by the students Janne Tamminen (TUT) 
and Kung-Chung Lee (visiting from the University of British Columbia). Most of the material 
was then checked by professor Robert Piche. I want to thank the translation team for their effort. 

The notes form the base text for the course "MAT-41196 Graph Theory". They contain 
an introduction to basic concepts and results in graph theory, with a special emphasis put on 
the network-theoretic circuit-cut dualism. In many ways a model was the elegant and careful 
presentation of S WAMY & THULASIRAMAN, especially the older (and better) edition. There are 
of course many modern text-books with similar contents, e.g. the popular GROSS & Yellen. 

One of the usages of graph theory is to give a unified formalism for many very different- 
looking problems. It then suffices to present algorithms in this common formalism. This has 
lead to the birth of a special class of algorithms, the so-called graph algorithms. Half of the 
text of these notes deals with graph algorithms, again putting emphasis on network-theoretic 
methods. Only basic algorithms, applicable to problems of moderate size, are treated here. 
Special classes of algorithms, such as those dealing with sparse large graphs, "small-world" 
graphs, or parallel algorithms will not be treated. In these algorithms, data structure issues have 
a large role, too (see e.g. Skiena). 

The basis of graph theory is in combinatorics, and the role of "graphics" is only in visual- 
izing things. Graph-theoretic applications and models usually involve connections to the "real 
world" on the one hand — often expressed in vivid graphical terms — and the definitional and 
computational methods given by the mathematical combinatoric and linear-algebraic machin- 
ery on the other. For many, this interplay is what makes graph theory so interesting. There is 
a part of graph theory which actually deals with graphical drawing and presentation of graphs, 
briefly touched in Chapter 6, where also simple algorithms are given for planarity testing and 
drawing. The presentation of the matter is quite superficial, a more profound treatment would 
require some rather deep results in topology and curve theory. Chapter 7 contains a brief intro- 
duction to matroids, a nice generalization and substitute for graphs in many ways. 

Proofs of graph-theoretic results and methods are usually not given in a completely rigorous 
combinatoric form, but rather using the possibilities of visualization given by graphical presen- 
tations of graphs. This can lead to situations where the reader may not be completely convinced 
of the validity of proofs and derivations. One of the goals of a course in graph theory must then 
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be to provide the student with the correct "touch" to such seemingly loose methods of proof. 
This is indeed necessary, as a completely rigoristic mathematical presentation is often almost 
unreadable, whereas an excessively slack and lacunar presentation is of course useless. 



Keijo Ruohonen 



Chapter 1 

Definitions and Fundamental Concepts 



1.1 Definitions 

Conceptually, a graph is formed by vertices and edges connecting the vertices. 



Example. 




Formally, a graph is a pair of sets (V, E), where V is the set of vertices and E is the set of 
edges, formed by pairs of vertices. E is a multiset, in other words, its elements can occur more 
than once so that every element has a multiplicity. Often, we label the vertices with letters (for 
example: a, b, c, . . . or t> 1; v 2 , ■ ■ ■ ) or numbers 1,2,... Throughout this lecture material, we will 
label the elements of V in this way. 

Example. (Continuing from the previous example) We label the vertices as follows : 




We have V = {vi, . . . , v 5 } for the vertices and E = {(t>i, v 2 ), (t>2, ^5), (fs, v 5 ), (t> 5 , t> 4 ), (t> 5 , t> 4 )} 
for the edges. 

Similarly, we often label the edges with letters (for example: a,b,c, . . . or ei, e 2 , . . . ) or num- 
bers 1,2,... for simplicity. 
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Remark. The two edges (u, v) and (v, u) are the same. In other words, the pair is not ordered. 
Example. (Continuing from the previous example) We label the edges as follows: 




SoE = {ei,...,e 5 }. 

We have the following terminologies: 

1. The two vertices u and v are end vertices of the edge (u, v). 

2. Edges that have the same end vertices are parallel. 

3. An edge of the form (v, v) is a loop. 

4. A graph is simple if it has no parallel edges or loops. 

5. A graph with no edges (i.e. E is empty) is empty. 

6. A graph with no vertices (i.e. V and E are empty) is a null graph. 

7. A graph with only one vertex is trivial. 

8. Edges are adjacent if they share a common end vertex. 

9. Two vertices u and v are adjacent if they are connected by an edge, in other words, (u, v) 
is an edge. 

10. The degree of the vertex v, written as d(v), is the number of edges with v as an end vertex. 
By convention, we count a loop twice and parallel edges contribute separately. 

11. A pendant vertex is a vertex whose degree is 1 . 

12. An edge that has a pendant vertex as an end vertex is a pendant edge. 

13. An isolated vertex is a vertex whose degree is 0. 
Example. (Continuing from the previous example) 

• v 4 and v 5 are end vertices ofe 5 . 

• e4 and e§ are parallel. 

• e3 is a loop. 

• The graph is not simple. 

• e\ and €2 are adjacent. 
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• v\ and v 2 are adjacent. 

• The degree ofv% is 1 so it is a pendant vertex. 

• t\ is a pendant edge. 

• The degree ofv§ is 5. 

• The degree ofv± is 2. 

• The degree ofv 3 is so it is an isolated vertex. 

In the future, we will label graphs with letters, for example: 

G = (V,E). 

The minimum degree of the vertices in a graph G is denoted 5(G) (= if there is an isolated 
vertex in G). Similarly, we write A(G) as the maximum degree of vertices in G. 

Example. (Continuing from the previous example) 5(G) = and A(G) = 5. 

Remark. In this course, we only consider finite graphs, i.e. V and E are finite sets. 

Since every edge has two end vertices, we get 

Theorem 1.1. The graph G = (V, E), where V = {vi, . . . , v n } and E = {ei, . . . , e m }, satisfies 

n 

d(vi) = 2m. 

8=1 

Corollary. Every graph has an even number of vertices of odd degree. 

Proof. If the vertices v 1 , . . . , have odd degrees and the vertices v k+ i, . . . , v n have even de- 
grees, then (Theorem 1.1) 

d(vi) H h d(v k ) = 2m - d(v k+i ) d(v n ) 

is even. Therefore, k is even. □ 

Example. (Continuing from the previous example) Now the sum of the degrees is 1 + 2 + + 
2 + 5 = 10 = 2-5. There are two vertices of odd degree, namely v\ and v^. 

A simple graph that contains every possible edge between all the vertices is called a complete 
graph. A complete graph with n vertices is denoted as K n . The first four complete graphs are 
given as examples: 

The graph Gi = (Vi, Ei) is a subgraph of G 2 = (V 2 ,E 2 ) if 

1. Vi C V 2 and 

2. Every edge of Gi is also an edge of G 2 . 
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and 




The subgraph ofG = (V, E) induced by the edge set Ex C E is: 

G\ = =def. (Ei), 

where V\ consists of every end vertex of the edges in E\ . 

Example. ( Continuing from above) From the original graph G, the edges e% and e$ induce 
the subgraph 




The subgraph ofG = (V,E) induced by the vertex set V\ C V is: 

Gi = (Vi, Ex) = def . (Vi), 
where E\ consists of every edge between the vertices in V\ . 

Example. ( Continuing from the previous example) From the original graph G, the vertices v\, 
v 3 and v 5 induce the subgraph 



<vi,v 3 ,v 5 >: 




A complete subgraph of G is called a clique of G. 



CHAPTER 1. DEFINITIONS AND FUNDAMENTAL CONCEPTS 



6 



1.2 Walks, Trails, Paths, Circuits, Connectivity, Components 

Remark. There are many different variations of the following terminologies. We will adhere to 
the definitions given here. 

A walk in the graph G = (V, E) is a finite sequence of the form 

which consists of alternating vertices and edges of G. The walk starts at a vertex. Vertices v it _ x 
and Vi t are end vertices of e Jt (t = 1, . . . , k). Vi is the initial vertex and Vi k is the terminal 
vertex, k is the length of the walk. A zero length walk is just a single vertex Vi . It is allowed to 
visit a vertex or go through an edge more than once. A walk is open if v io ^ v ik . Otherwise it 
is closed. 

Example. In the graph 




the walk 

t> 2 , e 7 , t> 5 , e g , t>i, e 8 , t> 5 , e 6 , t>4, e 5 , u 4 , e 5 , f 4 
w open. On the other hand, the walk 

v 4 , e 5 , f 4 , e 3 , w 3 , e 2 , f 2 , e 7 , v 5 , e 6 , w 4 

15 closed. 

A walk is a ?ra?7 if any edge is traversed at most once. Then, the number of times that the 
vertex pair u, v can appear as consecutive vertices in a trail is at most the number of parallel 
edges connecting u and v. 

Example. (Continuing from the previous example) The walk in the graph 

vx, e 8 , v 5 , eg, Vi, e x , v 2 , e 7 , v 5} e 6 , v 4 , e 5 , v 4} e 4 , v 4 

is a trail. 

A trail is a path if any vertex is visited at most once except possibly the initial and terminal 
vertices when they are the same. A closed path is a circuit. For simplicity, we will assume in 
the future that a circuit is not empty, i.e. its length > 1. We identify the paths and circuits with 
the subgraphs induced by their edges. 
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Example. (Continuing from the previous example) The walk 

v 2 ,e 7 ,v 5 ,e 6 ,v 4 ,e 3 ,v 3 

is a path and the walk 

V2,e 7 ,v 5 ,e 6 ,v 4: ,e 3 ,v 3 ,e2,V2 

is a circuit. 

The walk starting at u and ending at v is called an u-v walk, u and v are connected if there 
is a u-v walk in the graph (then there is also a u-v path!). If u and v are connected and v and w 
are connected, then u and w are also connected, i.e. if there is a u-v walk and a v-w walk, then 
there is also a u-w walk. A graph is connected if all the vertices are connected to each other. 
(A trivial graph is connected by convention.) 

Example. The graph 




is not connected. 

The subgraph G 1 (not a null graph) of the graph G is a component of G if 

1. G\ is connected and 

2. Either Gi is trivial (one single isolated vertex of G) or G\ is not trivial and G\ is the 
subgraph induced by those edges of G that have one end vertex in G\. 

Different components of the same graph do not have any common vertices because of the fol- 
lowing theorem. 

Theorem 1.2. If the graph G has a vertex v that is connected to a vertex of the component G\ 
ofG, then v is also a vertex ofG%. 

Proof. If v is connected to vertex v' of G\, then there is a walk in G 

v — v io i e ji ; V h i ■ ■ ■ i V ik-l ' e jk ' V ik ~ V ■ 

Since v' is a vertex of G\, then (condition #2 above) ej k is an edge of G± and v ik _ 1 is a vertex 
of G\. We continue this process and see that v is a vertex of G\. □ 

Example. 



G: 




The components of G are G\, G 2 , G 3 and G4. 
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Theorem 1.3. Every vertex of G belongs to exactly one component of G. Similarly, every edge 
ofG belongs to exactly one component ofG. 

Proof. We choose a vertex v in G. We do the following as many times as possible starting with 

V 1 = {v}: 

(*) If v' is a vertex of G such that v' £ V\ and v' is connected to some vertex of V±, then 

V ± <- V x U {v'}. 

Since there is a finite number of vertices in G, the process stops eventually. The last V\ induces a 
subgraph G\ of G that is the component of G containing v. G\ is connected because its vertices 
are connected to v so they are also connected to each other. Condition #2 holds because we can 
not repeat (*). By Theorem 1.2, v does not belong to any other component. 

The edges of the graph are incident to the end vertices of the components. □ 

Theorem 1.3 divides a graph into distinct components. The proof of the theorem gives an 
algorithm to do that. We have to repeat what we did in the proof as long as we have free 
vertices that do not belong to any component. Every isolated vertex forms its own component. 
A connected graph has only one component, namely, itself. 

A graph G with n vertices, m edges and k components has the rank 

p(G) = n-k. 

The nullity of the graph is 

H(G) = m — n + k. 
We see that p(G) > and p(G) + p(G) = m. In addition, p(G) > because 

Theorem 1.4. p(G) < m 

Proof. We will use the second principle of induction (strong induction) for m. 
Induction Basis : m = 0. The components are trivial and n = k. 
Induction Hypothesis: The theorem is true for m < p. (p > 1) 
Induction Statement : The theorem is true for m = p. 

Induction Statement Proof : We choose a component G\ of G which has at least one edge. 
We label that edge e and the end vertices u and v. We also label G 2 as the subgraph of G and 
Gi, obtained by removing the edge e from G\ (but not the vertices u and v). We label G' as the 
graph obtained by removing the edge e from G (but not the vertices u and v ) and let k' be the 
number of components of G'. We have two cases: 

1. G 2 is connected. Then, k' = k. We use the Induction Hypothesis on G'\ 

n — k = n — k' = p{G') < m — 1 < m. 

2. G 2 is not connected. Then there is only one path between u and v: 

u, e, v 

and no other path. Thus, there are two components in G 2 and k' = k + 1. We use the 
Induction Hypothesis on G'\ 



p(G') = n-k' = n-k-l<m-l. 
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Hence n — k < m. □ 

These kind of combinatorial results have many consequences. For example: 

Theorem 1.5. If G is a connected graph and k > 2 is the maximum path length, then any two 
paths in G with length k share at least one common vertex. 

Proof. We only consider the case where the paths are not circuits (Other cases can be proven in 
a similar way.). Consider two paths of G with length k: 

v ik (path 

and 

v i'o > e j[ > V A > • • • > e j' k » v i' k (P atn P2>- 

Let us consider the counter hypothesis: The paths p 1 and P2 do not share a common vertex. 
Since G is connected, there exists an Vi -Vi> k path. We then find the last vertex on this path 
which is also on p 1 (at least v io is on p{) and we label that vertex v it . We find the first vertex of 
the v it -Vi' k path which is also on p 2 (at least is on p 2 ) and we label that vertex v^. So we get 
a v k- v i' a P ath 

v it ,e j »,...,e j »,v i > > . 

The situation is as follows: 

v k) i e ji ) V ii i ■ ■ ■ i V it 5 e it+i 5 • • • 5 e jfc 5 ^Jfc 

Ji 



e 3" 

v i , o ,e j[ ,v i[ ,...,v i , s ,e j , +i ,...,e fk ,v ik 
From here we get two paths: v io -v ik path and v^-v^ path. The two cases are: 

• t > s: Now the length of the v io -Vi> path is > k + 1. ^/ 1 

• t < s: Now the length of the Vi> Q -v ik path is > A; + 1. yj □ 



A graph is circuitless if it does not have any circuit in it. 

Theorem 1.6. A graph is circuitless exactly when there are no loops and there is at most one 
path between any two given vertices. 

Proof. First let us assume G is circuitless. Then, there are no loops in G. Let us assume the 
counter hypothesis: There are two different paths between distinct vertices u and v in G: 

u = v io , e h , v h ,e j2 ,...,e jk , v ik = v (path p x ) 

and 

u = v i/o , e fi , , ey 2 , . . . , e fi ,v? t = v (path p 2 ) 
(here we have i = i' and ik = where k > I. We choose the smallest index t such that 

Vi t ^ v i' t - 

There is such a t because otherwise 

'From now on, the symbol ^/ means contradiction. If we get a contradiction by proceeding from the assump- 
tions, the hypothesis must be wrong. 
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1. k > £ and v ik = v = = v k (V ) or 

2. k = £ and v io — , . . . , v i( = v^. Then, there would be two parallel edges between two 
consecutive vertices in the path. That would imply the existence of a circuit between two 
vertices in G. 




We choose the smallest index s such that s > t and v is is in the path p 2 (at least v ik is in p 2 ). We 
choose an index r such that r > t and Vy r = v is (it exists because pi is a path). Then, 

Vit-x , e jt , . . . , e js ,v is (= v Vr ) , e# , . . . , e ft , (= ) 

is a circuit, -y/ (Verify the case t = s = r.) 

Let us prove the reverse implication. If the graph does not have any loops and no two distinct 
vertices have two different paths between them, then there is no circuit. For example, if 

v k> 5 e ji 5 v h i e j2i ■ ■ ■ i e jk > v ik = v k) 

is a circuit, then either k = 1 and is a loop (-y/ ), or k > 2 and the two vertices Vi and 
are connected by two distinct paths 

Vi ,e jl ,v il and v h ,e j2 , . . . ,e jk ,v ik = v io (yO- □ 

1.3 Graph Operations 

The complement of the simple graph G = (V, E) is the simple graph G = (V, E), where the 
edges in E are exactly the edges not in G. 

Example. 
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Example. The complement of the complete graph K n is the empty graph with n vertices. 

Obviously, V = G. If the graphs G = (V, E) and G = (V, E') are simple and V C V, then 
the difference graph is G — G' = (V, £"'), where contains those edges from G that are not 
in G (simple graph). 

Example. 




Here are some binary operations between two simple graphs G\ = (Vi,Ei) and G 2 = 
(V 2 ,E 2 ): 

• The union is G\ U G 2 = (Vi U V2, £Jj U E 2 ) (simple graph). 

• The intersection is Gi D G 2 = (Vi D V 2 , -Ei D £' 2 ) (simple graph). 

• The rm,g ^Mm Gi © G 2 is the subgraph of G\ U G 2 induced by the edge set E 1 © E 2 (simple 
graph). Note! The set operation © is the symmetric difference, i.e. 

E 1 ®E 2 = {E 1 - E 2 ) U (E 2 - Ex). 

Since the ring sum is a subgraph induced by an edge set, there are no isolated vertices. All three 
operations are commutative and associative. 

Example. For the graphs 
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we have 



G x u G 2 : 




Gj © G 2 : 




Remark. The operations U, fl and © can also be defined for more general graphs other than 
simple graphs. Naturally, we have to "keep track" of the multiplicity of the edges: 

U : The multiplicity of an edge in G\ U G 2 is the larger of its multiplicities in Gi and G 2 . 

fl : The multiplicity of an edge in G\ fl G2 is the smaller of its multiplicities in G\ and G 2 . 

© : The multiplicity of an edge in G\ © G2 is \rrii — m 2 \, where rrii is its multiplicity in G\ 
and m2 is its multiplicity in G 2 . 

(We assume zero multiplicity for the absence of an edge.) In addition, we can generalize the dif- 
ference operation for all kinds of graphs if we take account of the multiplicity. The multiplicity 
of the edge e in the difference G — G' is 



mi — rri2 



0, if mi < 1^2 



(also known as the proper difference), 



where mi and m 2 are the multiplicities of e in Gi and G 2 , respectively. 

If v is a vertex of the graph G = (V, E), then G — v is the subgraph of G induced by the 
vertex set V — {v}. We call this operation the removal of a vertex. 



Example. (Continuing from the previous example) 



v 2 
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Similarly, if e is an edge of the graph G = (V, E), then G — e is graph (V, E'), where E' is 
obtained by removing e from E. This operation is known as removal of an edge. We remark 
that we are not talking about removing an edge as in Set Theory, because the edge can have 
nonunit multiplicity and we only remove the edge once. 

Example. (Continuing from the previous example) 




If u and v are two distinct vertices of the graph G = (V, E), then we can short-circuit the 
two vertices u and v and obtain the graph (V, E'), where 

V = (V - {u, v}) U {w} (w £ V is the "new" vertex) 

and 

E' — (E — {(t/, u), (v', v)\v'e V}) U {(v', w) | (t/, «)G£or v) e ^} 

U {(w, w) | (m, m) G 22 or (u, u) G E} 

(Recall that the pair of vertices corresponding to an edge is not ordered). Note! We have to 
maintain the multiplicity of the edges. In particular, the edge (u,v) becomes a loop. 

Example. ( Continuing from the previous example) Short-circuit v% and V4 in the graph G\: 




In the graph G = (V, E), contracting the edge e = (u, v) (not a loop) means the operation in 
which we first remove e and then short-circuit u and v. (Contracting a loop simply removes that 
loop.) 

Example. (Continuing from the previous example) We contract the edge e 3 in Gi by first re- 
moving e 3 and then short-circuiting v 2 and v 3. 
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Remark. If we restrict short-circuiting and contracting to simple graphs, then we remove loops 
and all but one of the parallel edges between end vertices from the results. 

1.4 Cuts 

A vertex v of a graph G is a cut vertex or an articulation vertex of G if the graph G — v consists 
of a greater number of components than G. 

Example, v is a cut vertex of the graph below: 




(Note! Generally, the only vertex of a trivial graph is not a cut vertex, neither is an isolated 
vertex.) 

A graph is separable if it is not connected or if there exists at least one cut vertex in the 
graph. Otherwise, the graph is nonseparable. 

Example. The graph G in the previous example is separable. 

Example. The graph below is nonseparable. 
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A block of the graph G is a subgraph G x of G (not a null graph) such that 

• G\ is nonseparable, and 

• if G 2 is any other subgraph of G, then G1UG2 = Gi or Gi U G2 is separable (think about 
that!). 

Example. The graph below is separable: 



cut vertex 




Theorem 1.7. The vertex v is a cut vertex of the connected graph G if and only if there exist 
two vertices u and w in the graph G such that 

(i) v 7^ u, v 7^ w and u 7^ w, but 

(ii) v is on every u-w path. 

Proof First, let us consider the case that v is a cut-vertex of G. Then, G — v is not connected 
and there are at least two components G\ = (Vi, E%) and G 2 = (V2, E 2 ). We choose u G V\ 
and w E V 2 . The u-w path is in G because it is connected. If v is not on this path, then the path 
is also in G — v (-y/ ). The same reasoning can be used for all the u-w paths in G. 

If v is in every u-w path, then the vertices u and w are not connected in G — v. □ 
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Theorem 1.8. A nontrivial simple graph has at least two vertices which are not cut vertices. 

Proof. We will use induction for the graph G with n vertices. 
Induction Basis : The case n = 2 is obviously true. 
Induction Hypothesis: The theorem is true for n < k. (k > 2) 
Induction Statement : The theorem is true for n — k + 1. 

Induction Statement Proof : If there are no cut vertices in G, then it is obvious. Otherwise, 
we consider a cut vertex v of G. Let Gi, . . . , G m be the components of G — v (so m > 2). 
Every component Gi falls into one of the two cases: 

1 . Gi is trivial so the only vertex of Gi is a pendant vertex or an isolated vertex of G but it is 
not a cut vertex of G. 

2. Gi is not trivial. The Induction Hypothesis tells us that there exist two vertices u and w 
in Gi which are not cut vertices of Gi. If v and u (respectively v and to) are not adjacent 
in G, then u (respectively w) is not a cut vertex in G. If both v and u as well as u and w 
are adjacent in G, then u and w can not be cut vertices of G. □ 

A cut set of the connected graph G = (V, E) is an edge set F C E such that 

1 . G — F (remove the edges of F one by one) is not connected, and 

2. G — H is connected whenever H C F. 

Theorem 1.9. If F is a cut set of the connected graph G, then G — F has two components. 

Proof. Let F = {e±, . . . , e^}. The graph G — {e\, . . . , ek-i} is connected (and so is G if k = 1) 
by condition #2. When we remove the edges from the connected graph, we get at most two 
components. □ 

Example. In the graph 




{ei, e 4 }, {e 6 , e 7 }, {e 1 , e 2 , e 3 }, {e 8 }, {e 3 , e 4 , e 5 , e 6 }, {e 2 , e 5 , e 7 }, {e 2 , e 5 , e 6 } and {e 2 , e 3 , e 4 } are 
cut sets. Are there other cut sets? 

In a graph G — (V, E), a pair of subsets Vi and V 2 of V satisfying 

v = v 1 uv 2 , v 1 nv 2 = fH, ^17^0, ^ 27 ^0, 

is called a cm? (or a partition) of G, denoted (Vi, V 2 ). Usually, the cuts (Vi, V2) and (V 2 , Vi) are 
considered to be the same. 
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Example. (Continuing from the previous example) ({vi,V2, V3}, {t>4, V5, vq}) is a cut. 
We can also think of a cut as an edge set: 

cut (Vi, V 2 ) = {those edges with one end vertex in V\ and the other end vertex in V 2 }. 

(Note! This edge set does not define V 1 and V 2 uniquely so we can not use this for the definition 
of a cut.) 

Using the previous definitions and concepts, we can easily prove the following: 

1. The cut (Vi, V 2 ) of a connected graph G (considered as an edge set) is a cut set if and 
only if the subgraphs induced by V% and V 2 are connected, i.e. G — (Vi, V 2 ) has two 
components. 

2. If F is a cut set of the connected graph G and V\ and V 2 are the vertex sets of the two 
components of G — F, then (Vi, V 2 ) is a cut and F = (Vi, V2). 

3. If v is a vertex of a connected (nontrivial) graph G = (V, E), then ({v}, V — {v}) is a cut 
of G. It follows that the cut is a cut set if the subgraph (i.e. G — v ) induced by V — {v} 
is connected, i.e. if v is not a cut vertex. 

If there exists a cut (Vi, V 2 ) for the graph G = (V, E) so that E = (Vi, V 2 ), i.e. the cut 
(considered as an edge set) includes every edge, then the graph G is bipartite. 

Example. The graph 

v 4 
v 5 
v 6 
v 7 

is bipartite. Vi = {v 1, v 2 , v 3 } and V 2 = {t> 4 , v 5 , v 6 , v{\. 

A simple bipartite graph is called a complete bipartite graph if we can not possibly add any 
more edges to the edge set (Vi, V 2 ), i.e. the graph contains exactly all edges that have one end 
vertex in V\ and the other end vertex in V 2 . If there are n vertices in V\ and m vertices in V 2 , we 
denote it as K n>m (cf. complete graph). 
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Example. 




(Usually K n>m and K m ^ n are considered to be the same.) 

1.5 Labeled Graphs and Isomorphism 

By a labeling of the vertices of the graph G = (V, E), we mean a mapping a : V — >■ A, where 
A is called the label set. Similarly, a labeling of the edges is a mapping : E — > £>, where S is 
the label set. Often, these labels are numbers. Then, we call them weights of vertices and edges. 
In a weighted graph, the weight of a path is the sum of the weights of the edges traversed. 

The labeling of the vertices (respectively edges) is injective if distinct vertices (respectively 
edges) have distinct labels. An injective labeling is bijective if there are as many labels in A 
(respectively in B) as the number of vertices (respectively edges). 

Example. If A = {0, 1} and B — R, then in the graph, 




the labeling of the edges (weights) is injective but not the labeling of the vertices. 

The two graphs Gi = (Vi, E{) and G 2 = {V 2 , E 2 ) are isomorphic if labeling the vertices of 
Gi bijectively with the elements of V 2 gives G 2 - (Note! We have to maintain the multiplicity of 
the edges.) 
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Example. The graphs G\ and G2 are isomorphic and the vertex labeling Vi \-t v[ and edge 
labeling ej !->■ e'- define the isomorphism. 




Determining whether or not two graphs are isomorphic is a well researched problem. It 
differs significantly from other problems in graph theory and network analysis. In addition, 
it has a lot to do with group theory in algebra. The problem is important in the theory of 
Computational Complexity. For example, refer to KOBLER, J. & SCHONING, U. & TORAN, 
J.: The Graph Isomorphism Problem. Its Structural Complexity. Birkhauser (1993). 



Chapter 2 
Trees 



2.1 Trees and Forests 

A forest is a circuitless graph. A tree is a connected forest. A subforest is a subgraph of a forest. 
A connected subgraph of a tree is a subtree. Generally speaking, a subforest (respectively 
subtree) of a graph is its subgraph, which is also a forest (respectively tree). 

Example. Four trees which together form a forest: 




A spanning tree of a connected graph is a subtree that includes all the vertices of that graph. If 
T is a spanning tree of the graph G, then 

G-T = def . T* 

is the cospanning tree. 
Example. 




20 



CHAPTER 2. TREES 



21 



cospanning tree 



The edges of a spanning tree are called branches and the edges of the corresponding cospanning 
tree are called links or chords. 

Theorem 2.1. If the graph G has n vertices and m edges, then the following statements are 
equivalent: 

(i) G is a tree. 

(ii) There is exactly one path between any two vertices in G and G has no loops. 

(iii) G is connected and m — n — 1. 

(iv) G is circuitless and m = n — 1. 

(v) G is circuitless and if we add any new edge to G, then we will get one and only one circuit. 

Proof. (i)=Kii): If G is a tree, then it is connected and circuitless. Thus, there are no loops in 
G. There exists a path between any two vertices of G. By Theorem 1.6, we know that there is 
only one such path. 

(ii) =Kiii): G is connected. Let us use induction on m. 
Induction Basis : m = 0, G is trivial and the statement is obvious. 
Induction Hypothesis: m = n — 1 when m <£.(£> 0) 
Induction Statement : m = n — 1 when m — £ + 1. 

Induction Statement Proof : Let e be an edge in G. Then G — e has £ edges. If G — e is 
connected, then there exist two different paths between the end vertices of e so (ii) is false. 
Therefore, G — e has two components Gi and G 2 . Let there be n\ vertices and m 1 edges in G\. 
Similarly, let there be n 2 vertices and m 2 vertices in G 2 - Then, 

n = n\ + n 2 and m = mi + m 2 + 1. 

The Induction Hypothesis states that 

mi = rii — 1 and m 2 = n 2 — 1, 

som = ni+n 2 — 1 = n — 1. 

(iii) =Kiv): Consider the counter hypothesis: There is a circuit in G. Let e be some edge in 
that circuit. Thus, there are n vertices and n — 2 edges in the connected graph G — e. y/ 1 

(iv) =Kv): If G is circuitless, then there is at most one path between any two vertices (The- 
orem 1.6). If G has more than one component, then we will not get a circuit when we draw an 
edge between two different components. By adding edges, we can connect components without 
creating circuits: 




'in a connected graph with n vertices, there are at least n — 1 edges. (Theorem 1.4) 
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If we add fc(> 1) edges, then (because (i)=^(iii)) 

m + k = n — 1 (-y/ because m — n — 1). 

So G is connected. When we add an edge between vertices that are not adjacent, we get only 
one circuit. Otherwise, we can remove an edge from one circuit so that other circuits will not 
be affected and the graph stays connected, in contradiction to (iii)=Kiv). Similarly, if we add a 
parallel edge or a loop, we get exactly one circuit. 

(v)=Ki): Consider the counter hypothesis: G is not a tree, i.e. it is not connected. When we 
add edges as we did previously, we do not create any circuits (see figure), □ 

Since spanning trees are trees, Theorem 2.1 is also true for spanning trees. 

Theorem 2.2. A connected graph has at least one spanning tree. 

Proof. Consider the connected graph G with n vertices and m edges. If m = n — 1, then G 
is a tree. Since G is connected, m > n — 1 (Theorem 1.4). We still have to consider the case 
m > n, where there is a circuit in G. We remove an edge e from that circuit. G — e is now 
connected. We repeat until there are n — 1 edges. Then, we are left with a tree. □ 

Remark. We can get a spanning tree of a connected graph by starting from an arbitrary sub- 
forest M (as we did previously). Since there is no circuit whose edges are all in M, we can 
remove those edges from the circuit which are not in M. 

By Theorem 2.1, the subgraph G\ of G with n vertices is a spanning tree of G (thus G is 
connected) if any three of the following four conditions hold: 

1. G\ has n vertices. 

2. G\ is connected. 

3. G\ has n — 1 edges. 

4. Gi is circuitless. 

Actually, conditions #3 and #4 are enough to guarantee that G\ is a spanning tree. If conditions 
#3 and #4 hold but G\ is not connected, then the components of G\ are trees and the number of 
edges in G\ would be 



number of vertices — number of components < n — 1 (y/ ). 



CHAPTER 2. TREES 



23 



Theorem 2.3. If a tree is not trivial, then there are at least two pendant vertices. 

Proof. If a tree has n(> 2) vertices, then the sum of the degrees is 2(n — 1). If every vertex 
has a degree > 2, then the sum will be > 2n (y/ ). On the other hand, if all but one vertex have 
degree > 2, then the sum would be > 1 + 2{n — 1) = 2n — 1 (-y/ ). (This also follows from 
Theorem 1.8 because a cut vertex of a tree is not a pendant vertex!) □ 

A forest with k components is sometimes called a k-tree. (So a 1-tree is a tree.) 

Example. 



4-tree: 





We use Theorem 2.1 to see that a graph with k components has a spanning k-tree, also known 
as a spanning forest, which has k components. 



2.2 (Fundamental) Circuits and (Fundamental) Cut Sets 

If the branches of the spanning tree T of a connected graph G are bi, . . . , 6 n _i and the corre- 
sponding links of the cospanning tree T* are c x , . . . , c m _„ + i, then there exists one and only one 
circuit Cj in T + q (which is the subgraph of G induced by the branches of T and q) (The- 
orem 2.1). We call this circuit a fundamental circuit. Every spanning tree defines m — n + 1 
fundamental circuits C±, . . . , C m _ n+ i, which together form a fundamental set of circuits. Every 
fundamental circuit has exactly one link which is not in any other fundamental circuit in the 
fundamental set of circuits. Therefore, we can not write any fundamental circuit as a ring sum 
of other fundamental circuits in the same set. In other words, the fundamental set of circuits is 
linearly independent under the ring sum operation. 

Example. 
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The graph T — bi has two components T\ and T 2 . The corresponding vertex sets are V\ and 
V 2 . Then, (Vi, V 2 ) is a cut of G. It is also a cut set of G if we treat it as an edge set because 
G — (V\, V 2 ) has two components (result #1 p. 17). Thus, every branch bi of T has a corre- 
sponding cut set lj. The cut sets Ji, . . . , I n -i are a l so known as fundamental cut sets and they 
form & fundamental set of cut sets. Every fundamental cut set includes exactly one branch of T 
and every branch of T belongs to exactly one fundamental cut set. Therefore, every spanning 
tree defines a unique fundamental set of cut sets for G. 

Example. (Continuing from the previous example) The graph 



G: 




e 3 

has the spanning tree 

T: 



— — 2 

that defines these fundamental cut sets: 

h : {ei, e 2 } b 2 : {e 2 , e 3 , e 4 } 63 : {e 2 , e 4 , e 5 , e 6 } 

64 : {e 2 ,e 4 ,e 5 ,e 7 } 65 : {e 8 } 

Next, we consider some properties of circuits and cut sets: 

(a) Every cut set of a connected graph G includes at least one branch from every spanning 
tree of G. (Counter hypothesis: Some cut set F of G does not include any branches of a 
spanning tree T. Then, T is a subgraph of G — F and G — F is connected, ) 

(b) Every circuit of a connected graph G includes at least one link from every cospanning tree 
of G. (Counter hypothesis: Some circuit C of G does not include any link of a cospanning 
tree T*. Then, T = G -T* has a circuit and T is not a tree, y/ ) 
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Theorem 2.4. The edge set F of the connected graph G is a cut set of G if and only if 

(i) F includes at least one branch from every spanning tree ofG, and 

(ii) ifHcF, then there is a spanning tree none of whose branches is in H. 

Proof. Let us first consider the case where F is a cut set. Then, (i) is true (previous proposition 
(a)). If H C F then G — H is connected and has a spanning tree T. This T is also a spanning 
tree of G. Hence, (ii) is true. 

Let us next consider the case where both (i) and (ii) are true. Then G — F is disconnected. 
If H C F there is a spanning tree T none of whose branches is in H. Thus T is a subgraph of 
G — H and G — H is connected. Hence, F is a cut set. □ 

Similarly: 

Theorem 2.5. The subgraph C of the connected graph G is a circuit if and only if 

(i) C includes at least one link from every cospanning tree of G, and 

(ii) if D is a subgraph of C and D ^ C, then there exists a cospanning tree none of whose 
links is in D. 

Proof. Let us first consider the case where C is a circuit. Then, C includes at least one link 
from every cospanning tree (property (b) above) so (i) is true. If D is a proper subgraph of C, 
it obviously does not contain circuits, i.e. it is a forest. We can then supplement D so that it is 
a spanning tree of G (see remark on p. 22), i.e. some spanning tree T of G includes D and D 
does not include any link of T*. Thus, (ii) is true. 

Now we consider the case where (i) and (ii) are both true. Then, there has to be at least one 
circuit in C because C is otherwise a forest and we can supplement it so that it is a spanning 
tree of G (see remark on p. 22). We take a circuit C' in C. Since (ii) is true, C' ^ C is not 
true, because C' is a circuit and it includes a link from every cospanning tree (see property (b) 
above). Therefore, C = C' is a circuit. □ 

Theorem 2.6. A circuit and a cut set of a connected graph have an even number of common 
edges. 

Proof. We choose a circuit C and a cut set F of the connected graph G. G — F has two 
components G\ = (Vi,£i) and G 2 = (V2, £2). If C is a subgraph of G\ or G 2 , then the 
theorem is obvious because they have no common edges. Let us assume that C and F have 
common edges. We traverse around a circuit by starting at some vertex v of G\. Since we come 
back to v, there has to be an even number of edges of the cut (Vi, V 2 ) in C. □ 

The reader is advised to read the following several times: 

Theorem 2.7. A fundamental circuit corresponding to link c of the cospanning tree T* of a 
connected graph is formed exactly by those branches ofT whose corresponding fundamental 
cut set includes c. 

Proof. There exists a fundamental circuit C that corresponds to link c of T*. The other edges 
b\ , . . . , b k of C are branches of T. We denote Ii as the fundamental cut set that corresponds to 
branch hi. Then, hi is the only branch of T which is in both C and Ii. On the other hand, c is 
the only link of T* in C. By Theorem 2.6, we know that the common edges of C and Ii are b{ 
and c, in other words, c is an edge of I, L . Then, we show that there is no c in the fundamental cut 
sets ifc+i, . . . , I n -i that correspond to the branches b k+1 , . . . , b n _i of T. For instance, if c were 
in Jfc+i, then the fundamental cut set I k+1 and the circuit C would have exactly one common 
edge, {y/ ). So c is only in the fundamental cut sets h, ■ ■ ■ , h- □ 
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The following is the corresponding theorem for fundamental cut sets: 

Theorem 2.8. The fundamental cut set corresponding to branch b of the spanning tree T of a 
connected graph consists exactly of those links ofT* whose corresponding fundamental circuit 
includes b. 

Proof. Let I be a fundamental cut set that corresponds to the branch b of T. Other edges 
ci, . . . , Cfc of / are links of T*. Let Cj denote the fundamental circuit that corresponds to q. 
Then, q is the only link of T* in both / and Cj. On the other hand, b is the only branch of T in 
/. By Theorem 2.6, the common edges of / and C, are b and q, in other words, b is an edge of 
Ci. Then, we show that the fundamental circuits Cfc+i, . . . , C m _ n+ i corresponding to the links 
Cjfc+i, . . . , c m _ n+ i do not include b. For example, if b were in Ct+i, then the fundamental circuit 
Cfc+i and the cut set / would have exactly one common edge ). Hence, the branch b is only 
in fundamental circuits Ci, . . . , Cfc. □ 

From the results, we can see the duality between cut sets and circuits of a graph: The 
theorems for cut sets can generally be converted to dual theorems for circuits and vice versa. 
Usually, we just need to change some of the key terminologies to their duals in the theorems and 
proofs. In particular, we take advantage of this dualism for dealing with matroids (see Chapter 
7). 



Chapter 3 
Directed Graphs 



3.1 Definition 

Intuitively, a directed graph or digraph is formed by vertices connected by directed edges or 
arcs} 

Example. 




Formally, a digraph is a pair (V, E), where V is the vertex set and E is the set of vertex pairs as 
in "usual" graphs. The difference is that now the elements of E are ordered pairs: the arc from 
vertex u to vertex v is written as (u, v) and the other pair (v, u) is the opposite direction arc. We 
also have to keep track of the multiplicity of the arc (direction of a loop is irrelevant). We can 
pretty much use the same notions and results for digraphs from Chapter 1 . However: 

1. Vertex u is the initial vertex and vertex v is the terminal vertex of the arc (u,v). We also 
say that the arc is incident out of u and incident into v. 

2. The out-degree of the vertex v is the number of arcs out of it (denoted d + (v)) and the 
in-degree of v is the number of arcs going into it (denoted d~~(v)). 

3. In the directed walk (trail, path or circuit), 

v it is the initial vertex and v ii _ 1 is the terminal vertex of the arc e Jr . 

4. When we treat the graph (V, E) as a usual undirected graph, it is the underlying undirected 
graph of the digraph G = (V, E), denoted G u . 

'This not a standard terminology. We will however call directed edges arcs in the sequel. 



27 



CHAPTER 3. DIRECTED GRAPHS 



28 



5. Digraph G is connected if G u is connected. The components of G are the directed sub- 
graphs of G that correspond to the components of G u . The vertices of G are connected if 
they are connected in G u . Other notions for undirected graphs can be used for digraphs 
as well by dealing with the underlying undirected graph. 

6. Vertices u and v are strongly connected if there is a directed u-v path and also a directed 
v-u path in G. 

7. Digraph G is strongly connected if every pair of vertices is strongly connected. By con- 
vention, the trivial graph is strongly connected. 

8. A strongly connected component H of the digraph G is a directed subgraph of G (not a 
null graph) such that H is strongly connected, but if we add any vertices or arcs to it, then 
it is not strongly connected anymore. 

Every vertex of the digraph G belongs to one strongly connected component of G (compare to 
Theorem 1.3). However, an arc does not necessarily belong to any strongly connected compo- 
nent of G. 

Example. For the digraph G 




the strongly connected components are ({f i}, 0), ({^2, ^3, ^4}, {e3, e±, e$}), ({vs}, 0) and ({fe}, 0)- 

The condensed graph G c of the digraph G is obtained by contracting all the arcs in every 
strongly connected component. 

Example. (Continuing from the previous example) The condensed graph is 
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3.2 Directed Trees 

A directed graph is quasi-strongly connected if one of the following conditions holds for every 
pair of vertices u and v: 

(i) u — v or 

(ii) there is a directed u-v path in the digraph or 

(iii) there is a directed v-u path in the digraph or 

(iv) there is a vertex w so that there is a directed w-u path and a directed w-v path. 

Example. (Continuing from the previous example) The digraph G is quasi-strongly connected. 

Quasi-strongly connected digraphs are connected but not necessarily strongly connected. 

The vertex v of the digraph G is a root if there is a directed path from v to every other vertex 
ofG. 

Example. (Continuing from the previous example) The digraph G only has one root, v±. 

Theorem 3.1. A digraph has at least one root if and only if it is quasi-strongly connected. 

Proof. If there is a root in the digraph, it follows from the definition that the digraph is quasi- 
strongly connected. 

Let us consider a quasi-strongly connected digraph G and show that it must have at least 
one root. If G is trivial, then it is obvious. Otherwise, consider the vertex set V = {vi, . . . , v n } 
of G where n > 2. The following process shows that there must be a root: 

1. SetP <- V. 

2. If there is a directed u-v path between two distinct vertices u and v in P, then we remove 
v from P. Equivalently, we set P <— P — {v }. We repeat this step as many times as 
possible. 

3. If there is only one vertex left in P, then it is the root. For other cases, there are at least 
two distinct vertices u and v in P and there is no directed path between them in either 
direction. Since G is quasi-strongly connected, from condition (iv) it follows that there 
is a vertex w and a directed w-u path as well as a directed w-v path. Since u is in P, w 
can not be in P. We remove u and v from P and add w, i.e. we set P <— P — {u, v} and 
P^PU {w}. Go back to step #2. 

4. Repeat as many times as possible. 

Every time we do this, there are fewer and fewer vertices in P. Eventually, we will get a root 
because there is a directed path from some vertex in P to every vertex we removed from P. □ 

The digraph G is a tree if G u is a tree. It is a directed tree if G u is a tree and G is quasi- 
strongly connected, i.e. it has a root. A leaf of a directed tree is a vertex whose out-degree is 
zero. 
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Example. 




Theorem 3.2. For the digraph G with n > 1 vertices, the following are equivalent: 

(i) G is a directed tree. 

(ii) G is a tree with a vertex from which there is exactly one directed path to every other vertex 
ofG. 

(iii) G is quasi- strongly connected but G — e is not quasi- strongly connected for any arc e in 
G 

(iv) G is quasi- strongly connected and every vertex of G has an in-degree of 1 except one 
vertex whose in-degree is zero. 

(v) There are no circuits in G (i.e. not in G n ) and every vertex of G has an in-degree of 1 
except one vertex whose in-degree is zero. 

(vi) G is quasi- strongly connected and there are no circuits in G {i.e. not in G u ). 

Proof. (i)=Kii): If G is a directed tree, then there is a root. This implies that there is a directed 
path from the root to every other vertex in G (but not more than one path since G u is a tree). 

(ii) =>(iii): If (ii) is true, then G obviously is quasi- strongly connected. We will prove by 
contradiction by considering the counter hypothesis: There is an arc e in G such that G — e is 
quasi- strongly connected. The arc e is not a loop because G is a directed tree. Let u and v be 
the two different end vertices of e. There does not exist a directed u-v path or a directed v-u 
path in G — e (otherwise G u would have a circuit). Therefore, there is a vertex w and a directed 
w-u path as well as a directed w-v path. However, this leads to the existence of two directed 
w-u paths or two directed w-v paths in G depending on the direction of the arc e. Then, there 
is a circuit in the tree G u . (y/ by Theorem 1.6). 

(iii) =Kiv): If G quasi-strongly connected, then it has a root r (Theorem 3.1) so that the in- 
degrees of other vertices are > 1. We start by considering the counter hypothesis: There exists 
a vertex v ^ r and d~(v) > 1. Then, v is the terminal vertex of two distinct arcs (u,v) and 
(w, v). If there were a loop e in G, then G — e would be quasi-strongly connected (-y/ ). Thus, 
u 7^ v with w 7^ v. Now, there are two distinct directed trails from r to v. The first one includes 
(u, v) and the second one includes (w, v). We have two possible cases: 
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r 




In the digraph on the left, the paths r-u and r-w do not include the arcs (u,v) and (w,v). 
Both G — (u, v) and G — (w, v) are quasi-strongly connected. In the digraph on the right, 
the r-u path includes the arc (w, v) or (as in the figure) the r-w path includes the arc (u, v). 
In either case, only one of G — (u, v) and G — (w, v) is quasi-strongly connected because 
the root is r (Theorem 3.1). (y/ ) We still have to show that d~(r) = 0. Let us consider the 
counter hypothesis: d~(r) > 1. Then, r is the terminal vertex of some arc e. However, the tree 
G — e is then quasi-strongly connected since r is its root (Theorem 3.1). (-y/ ) 

(iv) =Kv): If (iv) is true, then it is enough to show that there are no circuits in G u . The sum 
of in-degrees of all the vertices in G is n — 1 and the sum of out-degrees of all the vertices in G 
is also n — 1, i.e. there are n — 1 arcs in G. Since G is quasi-strongly connected, it is connected 
and it is a tree (Theorem 2.1). Therefore, there are no circuits in G u . 

(v) =Kvi): If we assume that (v) is true, then there are n — 1 arcs in G (compare to the 
previous proof). By Theorem 2.1, G is a tree. We denote by r the vertex satisfying condition 
(v). By Theorem 2.1, we see that there is exactly one path to any other vertex of G from r. 
These paths are also directed. Otherwise, d~(r) > 1 or the in-degree of some vertex on that 
path is > 1 or the in-degree of some other vertex other than r on that path is zero. Hence, r is a 
root and G is quasi-strongly connected (Theorem 3.1). 

(vi) =Ki): If G is quasi-strongly connected, then it has a root (Theorem 3.1). Since G is 
connected and there are no circuits in G, it is a tree. □ 

A directed subgraph T of the digraph G is a directed spanning tree if T is a directed tree 
and T includes every vertex of G. 



Theorem 3.3. A digraph has a directed spanning tree if and only if it is quasi-strongly con- 
nected. 

Proof. If the digraph G has a directed spanning tree T, then the root of T is also a root for G 
and it is quasi-strongly connected (Theorem 3.1). 

We now assume that G is quasi-strongly connected and show that it has a directed spanning 
tree. If G is a directed tree, then it is obvious. Otherwise, from Theorem 3.2, we know that there 



Example. 



G: 
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is an arc e in G so that if we remove e, G remains quasi-strongly connected. We systematically 
remove these kind of arcs until we get a directed tree. (Compare to the proof for Theorem 
2.2) " □ 



3.3 Acyclic Directed Graphs 

A directed graph with at least one directed circuit is said to be cyclic. A directed graph is acyclic 
otherwise. Obviously, directed trees are acyclic but the reverse implication is not true. 

Example. The digraph 




is acyclic but it is not a directed tree. 

Theorem 3.4. In an acyclic digraph, there exist at least one source (a vertex whose in-degree 
is zero) and at least one sink (a vertex whose out-degree is zero). 

Proof. Let G be an acyclic digraph. If G has no arcs, then it is obvious. Otherwise, let us 
consider the directed path 

which has the maximum path length k. Since G is acyclic, v io ^ v ik . If (v, v io ) is an arc, then 
one of the following is true: 

• v v it for every value of t — 0, . . . , k. Then, 

V, [V, ^j ) j V{ , 6j 1 , Ufa , €j 2 , . . . , €j k , Vi k 

is a directed path with length k + 1. y/ 

• v = v it for some value of t. We choose the smallest such t. Then, t > because there are 
no loops in G and 

is a directed circuit, yj 

Hence, dr{v i( ) = 0. Using a similar technique, we can show that d + (v ik ) = as well. □ 

If G = (V, E) is a digraph with n vertices, then a labeling of the vertices with an injective 
function a : V — > {1, . . . , n} which satisfies the condition a(u) < a(v) whenever (u, v) is an 
arc in G is known as topological sorting. 
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Theorem 3.5. We can sort the vertices of a digraph topologically if and only if the graph is 
acyclic. 

Proof. If the digraph is cyclic, then obviously we can not sort the vertices topologically. 
If the digraph G is acyclic, then we can sort the vertices in the following manner: 2 

1. We choose a vertex v which is a sink. It exists by Theorem 3.4. We set ct(v) n, 
G G — v and n ^— n — 1 . 

2. If there is just one vertex v in G, set ct(v) <- 1. Otherwise, go back to step #1. □ 



2 This is known as Marimont's Algorithm. The algorithm itself contains other items, too. The original refer- 
ence is MARIMONT, R.B.: A New Method of Checking the Consistency of Precedence Matrices. Journal of the 
Association for Computing Machinery 6 (1959), 164-171. 



Chapter 4 

Matrices and Vector Spaces of Graphs 



4.1 Matrix Representation of Graphs 



The adjacency matrix of the graph G = (V, E) is an n x n matrix D = (dij), where n is the 
number of vertices in G, V = {v±, . . . , v n } and 

dij = number of edges between Vi and Vj. 

In particular, dy = if (v j, f^) is not an edge in G. The matrix D is symmetric, i.e. D 

Example. 



T 



D. 





D 



/ 2 1 \ 

2 10 10 

1 3 

10 

\ / 



Obviously, an adjacency matrix defines a graph completely up to an isomorphism. 
The adjacency matrix of a directed graph G is D = (dij), where 

d^ = number of arcs that come out of vertex Vi and go into vertex Vj. 

Example. 

/ 1 \ 
10 




D 




\ 2 1 1 / 



The all-vertex incidence matrix of a non-empty and loopless graph G = (V, E) is an n x m 
matrix A = (a^), where n is the number of vertices in G, m is the number of edges in G and 

1 if Vi is an end vertex of ej 
otherwise. 
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Example. e x 



A 





ei 


e2 


e 3 


e4 






/ 


1 


1 


1 





\ 


V\ 




1 


1 





1 




V2 










1 







V3 













1 




Vi 


V 














/ 


v 5 



The all-vertex incidence matrix of a non-empty and loopless directed graph G is A 
where 

1 if Vi is the initial vertex of ej 
— 1 if Vi is the terminal vertex of ej 
^ otherwise. 

Example. 



A 




«3A A<? 4 





ei 


e 2 


e 3 


e4 


e 5 




I 


1 


-1 


-1 


-1 


o\ 


Vl 




-1 


1 








-1 


V2 



















l>3 


\ 








1 


1 


1 J 





Since every column of an all-vertex incidence matrix contains exactly two non-zero num- 
bers, two ones, we can remove a row and still have enough information to define the graph. The 
incidence matrix of a graph is obtained by removing a row from the all- vertex incidence matrix. 
It is not unique because there are n possible rows to remove. The vertex corresponding to the 
row removed is called the reference vertex. 

Similarly, every column in the all- vertex incidence matrix of a digraph contains exactly two 
non-zero numbers, +1 and —1. We can remove a row from the all- vertex incidence matrix and 
obtain the incidence matrix. Notice that the rows of an all-vertex incidence matrix are linearly 
dependent because the sum of rows is a zero vector. 

Theorem 4.1. The determinant of an incidence matrix of a nontrivial tree is ±1, regardless of 
whether the tree is a directed graph or not. 

Proof. We use induction on n, the number of vertices in the tree. 
Induction Basis : n = 2 and it is obvious. 
Induction Hypothesis: The theorem is true for n < k. (k > 2) 
Induction Statement : The theorem is true for n — k + l. 

Induction Statement Proof : Let T be a tree which has k + 1 vertices and let A be an (arbi- 
trary) incidence matrix of T. T has at least two pendant vertices (Theorem 2.3). We choose a 
pendant vertex v,- L which is not the reference vertex of A and the edge e t which is incident on v j. 
Then, 

a it = (±)1 and = 0, when j ^ t. 
We expand the determinant of | A| by the i th row: 

|A| = (±)(-l)*+'|A'|, 
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where A' is the minor corresponding to a it . We write T' = T — i>, which is also a tree (Vi is a 
pendant vertex). We use the induction hypothesis to get | A'| = ±1 because A' is obviously an 
incidence matrix of T'. □ 

Corollary. If the digraph G has no loops, then the rank of its all-vertex incidence matrix is 
p(G). 

Proof. If we rearrange the rows or columns of the all-vertex incidence matrix, the rank of the 
matrix will not change. Let us rearrange the vertices and arcs to group them by components. 
Then, the all-vertex incidence matrix is a block diagonal matrix in which each block is an all- 
vertex incidence matrix of a component. 



1 st compo- 
nent 






2 nd compo- 
nent 



\ 



o 



o 



V 



k th compo- 
nent 



/ 



We denote ri\ as the number of vertices in the i th component. Every component has a spanning 
tree whose incidence matrix has a nonzero determinant by Theorem 4.1, i.e. the matrix is not 
singular. The all-vertex incidence matrix of the i th component is obtained by adding columns 
and one row to an incidence matrix of the corresponding spanning tree. The row added is 
linearly dependent of other rows so that the rank of this matrix is the same as the rank of the 
incidence matrix (= — 1). Notice that in the special case when a component is trivial, the 
rank is zero =1 — 1. Therefore, 



rank of A = sum of the ranks of the components 

= (ni - 1) + • • • + K - 1) 

+ n k -k = p(G). 



= Hi + 
v 



□ 



= n 



Remark. From this proof we can also get a basis for the row space and the column space of 
the all-vertex incidence matrix. The columns corresponding to the branches of the spanning 
forest of G are a basis of the column space. We can get a basis of the row space by removing 
one row out of each component block. 



4.2 Cut Matrix 

If all the cuts of a nontrivial and loopless graph G = (V,E) are ii, . . . , I t , then the cut matrix 
of G is a t x m matrix Q = (qij), where m is the number of edges in G and 




1 if ej G Li (the cut is interpreted as an edge set) 
otherwise. 
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Example. For the graph 



v 




the cuts are I\ = {ei, e 4 }, / 2 = {e 2 , e 4 } and / 3 = {ex, e 2 



e%\. The cut matrix is 



ei e 2 e 3 e 4 



Q 



1 1 \ h 
0111/2 
1 1 1 ) h 



Remark. If the graph has n vertices, then it has |(2 n — 2) = 2 n ~ 1 — 1 cuts. Usually, there are 
not this many distinct edge sets. For the cut matrix, we only take one cut corresponding to an 
edge set so that there would not be repeated rows. Even so, there are usually too many rows. 

If G is a nontrivial and loopless digraph, then we assign an arbitrary direction to every cut 
(Vi, V2)' the orientation of (Vi, V 2 ) is from V\ to V 2 . In other words, we consider oriented cuts 
and we pick only one direction from the two possibilities. Then, the cut matrix Q = (g^) is 



the different cuts (interpreted as edge sets) are I\ = {ei, e 2 , e 3 , e 4 } (in the direction of ex), 
h = {^3, e 4 , e 5 } (in the direction ofe 3 ), / 3 = {ei, e 2 , e 5 } (in the direction of e\) and / 4 = 0. 
The cut matrix is 




1 if tj e Ii and they are in the same direction 
— 1 if Cj E Ii and they are in opposite directions 
otherwise. 



Example. For the digraph 




ei e 2 



e 3 e 4 e 5 



Q 



/ 1 -1 



1 -1 
\ 



1 -1 \h 
1 11/2 
001/3 
j h 
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Since ({v}, V — {v}) is a cut for every vertex v, rows of the all-vertex incidence matrix are 
rows of Q. If we are dealing with directed graphs, then these rows may have to be multiplied 
by -I. 

Theorem 4.2. Every row of the cut matrix of a digraph can be expressed in two different ways 
as a linear combination of the rows of the all-vertex incidence matrix. The non-zero coefficients 
are either all — +1 or all — — 1. 

Proof. Let Q be the cut matrix of a digraph G = (V, E) and let A be the all-vertex incidence 
matrix. Let (Vi, V 2 ) (note that it is oriented) be the cut corresponding to the i th row of Q. 
Reindexing if needed, we can assume that 

Vi = {v!, . . . , v r } and V 2 = {v r+ i, v n }. 

We write 

qj = i th row of Q and & t = t th row of A. 

We show that 

r n 
t=l t=r+l 

which proves the theorem. Let (v p , v q ) = be the k th arc of G. Then, 

a n h = k th element of the vector a„ = 1 , 



l pk = K cicmcui ui uic veciui a p 
i q k = k th element of the vector & q 



and 

a jk = if j ^ p, q. 

We get four cases: 

• v p EV 1 and v q G V 2 : Now p < r and q > r + 1 so q ik = 1 and 

r n 

qik = ^^atk = — ^ a tk . 

t=l t=r+l 

• v p eV 2 and v q e V\. Now p > r + 1 and q < r so q ik — —1 and 

r n 

Qik = ^2<^tk = — ^ a>tk- 

t=l t=r+l 

• Vp G V\ and v q G V\ : Now p < r and q < r so q ik = and 



— a tk — — dr+l,k — ■ ■ ■ — a nk . 



t=l 



• v p G Vi and v q G V 2 : Now p > r + 1 and q > r + 1 so q ik — and 



to— ^OiaH h^a^— — ^ a tfe . 

=0 =0 t=r+l 
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The statements above are valid for every k. □ 
Example. (Continuing from the previous example) The corresponding row of l\ is 
(1,-1,-1,-1,0) = (1,-1,-1,-1,0) = -(-1,1, 0,0,-1) -(0,0, 0,0,0) -(0,0, 1,1,1). 
Corollary. The rank of the cut matrix of a digraph G is p{G). 

Proof. The all-vertex incidence matrix A of G is also a submatrix of the cut matrix Q of G. 
Then, (by Corollary of Theorem 4. 1) 

rank(Q) > rank(A) = p(G). 

On the other hand, by Theorem 4.2, every row of Q can be expressed as a linear combination 
of the rows of A. Therefore, 

rank(Q) = rank(A) = p(G). 

□ 

Another consequence is that the cut matrix Q can be expressed as 

Q = AxA, 

where the elements of A x are or ±1. In addition, the matrix Ai can be constructed from the 
process in the proof of Theorem 4.2. 

If the graph G is connected, then it has a spanning tree T and an associated fundamental cut 
set. The fundamental cut sets are also cuts (when cuts are interpreted as edge sets). Therefore, 
the cut matrix Q of G has a submatrix Qf that corresponds to these fundamental cut sets. 
This matrix is called the fundamental cut set matrix. Similarly, the connected digraph G has 
a fundamental cut set matrix: if we interpret a fundamental cut set as a set, then the direction 
of the cut is chosen to be the same as the direction of the corresponding branch of T. If we 
rearrange the edges of G so that we have the branches first and sort the fundamental cut sets in 
the same order, then we get the fundamental cut set matrix in the form 

Qf = ( In-l | Qfc ) ? 

where I n _i is the identity matrix with n — 1 rows. The rank of Qf is thus n — 1 = p(G). 

Example. ( Continuing from the previous example) We left out vertex v 3 so we get a connected 
digraph. We choose the spanning tree 

e \ 

Vif — >«v 2 



T: e 3 



»v 4 



The fundamental cut sets are I 2 = {e 3 , e 4 , e 5 } (in the direction ofe 3 ) and I 3 = {ei, e 2 , e 5 } (in 
the direction of e\). Then, 
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Q f 



and 



Q 







^2 


o . 

c-4 




/ 


i n 
1 u 


1 

— 1 


u 


1 \ 


( 


n i 

U 1 


n 
u 


i 
i 


) 


ei 


e 2 


e 3 


e 4 


e 5 


1 


-1 


-1 


-1 











1 


1 




1 


-1 











/ 3 



4.3 Circuit Matrix 

We consider a loopless graph G = (V, E 1 ) which contains circuits. We enumerate the circuits of 
G: Ci, . . . , C e . The circuit matrix of G is an £ x m matrix B = where 

J 1 if the arc ej is in the circuit Cj 
10 otherwise 

(as usual, E = {e X: . . . ,e m }). 

The circuits in the digraph G are oriented, i.e. every circuit is given an arbitrary direction 
for the sake of defining the circuit matrix. After choosing the orientations, the circuit matrix of 
G is B = {bij) where 

{1 if the arc ej is in the circuit Cj and they in the same direction 
— 1 if the arc ej is in the circuit Ci and they are in the opposite direction 
otherwise. 

Example. For the directed graph 




the circuits are 




and the circuit matrix is 
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B 



ei 


e 2 


e 3 


e4 


1 





-1 


1 


-1 


1 





-1 





-1 


1 







If the graph G is connected and contains at least one circuit, then it has a cospanning tree 
T* and the corresponding fundamental circuits. By choosing the corresponding rows of the 
circuit matrix B, we get an (m — n + 1) x m matrix B f , called the fundamental circuit matrix. 
Similarly, a connected digraph G with at least one circuit has a fundamental circuit matrix: the 
direction of a fundamental circuit is the same as the direction of the corresponding link in T*. 

When we rearrange the edges of G so that the links of T* come last and sort the fundamental 
circuits in the same order, the fundamental circuit matrix takes the form 



B f 



( Bf t | I m _ n+ i ) 



where I m _ n+1 is the identity matrix with m—n+1 rows. The rank of B f is thus m—n+1 = fi(G) 
and the rank of B is > m — n + 1. 

Example. (Continuing from the previous example) We left out vertex v% so we get a connected 
digraph (see p. 34) and we chose the spanning tree 



T: e 3 



v 4 



The fundamental circuits are 



Z> v 2 CfV 



e 3 M M^ 4 

V 




and 



B f 



ei 


e 3 


e 2 


e 4 


e 5 







1 







(i 


-1 





1 


i) 




-1 
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Theorem 4.3. An oriented cut and an oriented circuit of a digraph have an even number of 
common arcs. Half of these arcs have the same direction in the cut and in the circuit, and the 
remaining arcs have opposite directions in the cut and in the circuit. 

Proof. Compare to the proof of Theorem 2.6. □ 
Theorem 4.4. For a digraph, BQ T = O (zero matrix). 

Proof. By the previous theorem, half of the nonzero numbers in the dot product corresponding 
to each element of BQ T are +1. The remaining nonzero numbers are —1. Therefore, the dot 
product is = 0. □ 

Theorem 4.5. If the digraph G contains at least one circuit, then the rank of its circuit matrix 
B is fi(G). Furthermore, if G is connected, then the circuit matrix B can be expressed as 
B = B 2 B f , where the matrix B 2 consists ofO 's and ±1 's, and the cut matrix Q can be expressed 
as Q = QiQf, where the matrix Qi consists ofO 's and ±1 's. 

Proof. First we consider the case when G is connected. We choose a spanning tree T of G and 
rearrange the m edges of G so that the branches of T come first and the links of T* come last. 
We sort the fundamental cut sets in the same order as the branches and links. Then, 

Q f = ( I„_i | Q fc ) and B f = ( B ft | I m _ n+ i ) . 

The blocks of B can be constructed in a similar way: 

B = ( Bi | B 2 ) . 

Since Q f is a submatrix of Q and B f is a submatrix of B, it follows from Theorem 4.4 that 

O = BfQj = ( B ft | I m _ n+ l ) ( I n _l | Qfc ) = ( Bf t | l m -n+l ) ^ ^ 

= B ft I n _ 1 + I m _ n+1 Qj c = B ft + Q£. 

Hence 

Bft = -Qfc. 

Furthermore, since Q f is a submatrix of Q, we can use the same theorem to get 

,T / „ I „ \ ( In-1 



O = BQ^ = ( Bf | B 2 ) ( I n _! | Qfc y = ( Bf | B 2 ) 



Qfc 



Bil„f + B 2 Q£ = Bf — B 2 B ft . 



Cfc 

Hence 

B = ( B 2 B ft | B 2 ) = B 2 ( B ft | I m _ n+ f ) = B 2 B f , 

as claimed. In the same way, Q can be expressed as Q = QfQf, as claimed, which is clear 
anyway since the rank of Q is n — 1 and its elements are 0's and ±l's. 

Every row of B is a linear combination of the rows corresponding to the fundamental circuits 
and the rank of B is at most equal to the rank of Bf = m — n + 1. On the other hand, as we 
pointed out earlier, the rank of B is > m — n + 1. Thus, rank(B) = m — n + 1 (= ^(G)) for a 
connected digraph. 

In the case of a disconnected digraph G (which contains at least one circuit), it is divided into 
components (k > 2 components) and the circuit matrix B is divided into blocks corresponding 
to the components (compare to the proof of the corollary of Theorem 4.1), in which case 

k 

rank(B) = " s ^(m i — rii + 1) = m — n + k = /i(G). □ 

i=i 
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Notice that the proof also gives the formula, B ft = — Qj c , which connects the fundamental 
cut matrix and the fundamental circuit matrix. 



4.4 An Application: Stationary Linear Networks 

A stationary linear network is a directed graph G that satisfies the following conditions: 

1 . G is connected. 

2. Every arc of G belongs to some circuit and there are no loops in G. 

3. Every arc ej in G is associated with a number ij called the through-quantity or flow. If 
there are m arcs in G, then we write 

/ h 

i = 



V 



(through-vector). 



4. Every vertex Vi in G is associated with a number p i called the potential. Furthermore, the 
across-quantity or potential difference of the arc e,j = (v^ , v i2 ) is 

Uj Pi 2 p^ . 

If there are n vertices and m arcs in G, then we write 



/Pi \ 



/ Mi 

and u = j 



\Pn ) 

(potential vector and across-vector). (Potentials are rarely needed.) 

5. Every arc Cj is one of the following: 

(a) a component 1 , for which there is an associated number rj. rj is constant 0) 
(stationarity) and the following equation links the quantities: 

Uj = ijTj (linearity). 

(b) a through-source, for which the through-quantity ij is fixed. 

(c) an across-source, for which the across-quantity Uj is fixed. 

6. (Kirchhoff's Through-Quantity Law) The sum of the through-quantities of an oriented cut 
of G is zero when the cut is interpreted as an edge set and the sign of a through-quantity 
is changed if the directions of a cut and an arc are different. 

7. (Kirchhoff's Across-Quantity Law) The sum of the across-quantities of an oriented circuit 
of G is zero when the sign of an across-quantity is changed if the directions of a circuit 
and an arc are different. 



'Not to be confused with a component of a graph! 
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Example. A typical stationary linear network is an electrical circuit with linear resistors, con- 
stant current sources and constant voltage sources. The components are resistors and rj are the 
resistances. Equation 5. (a) is Ohm's Law. 

We take a spanning tree T of a stationary linear network G, its fundamental cut matrix Q f 
and its fundamental circuit matrix B f . Let us rearrange the arcs in these matrices and vectors i 
and u like we did before. That is, the branches of T will come first followed by the links of T*. 
Kirchhoff 's Laws can then be written as 

Qi = and Bu = 0. 

On the other hand, the rows of the fundamental cut matrix Q f span all the rows of Q, and 
similarly rows of the fundamental circuit matrix B f span the rows of B. Then, Kirchhoff's 
Laws can also be written as 

Q f i = n i and B f u = m _„+i. 
Let us form the diagonal matrices K = \k 1: . . . , k m \ and L = \£ 1: . . . , £ m \ , where 

-Tj if e 3 -is a component j 1 if is a component 

kj — ^1 if ej is a through-source and £j — < if ej is a through-source 
if ej is an across-source [ 1 if ej is an across-source, 

and the m-vector s = (si, . . . , s m ) T , where 

if ej is a component 
Sj = ^ ij if ej is a through- source 
Uj if ej is an across-source. 

Then, all the information can be expressed as a system of linear equations 



K 


L 


Qf 


0(n— l)xm 


0(m-n+l)xm 










71—1 



0m— n+1 

known as the fundamental equations. The through and across quantities can be solved (ideally) 
if Tj and the sources are given. 

Remark. The same procedure can be applied to form state {differential) equations for dynamic 
networks, which have nonstationary components. 

The matrix of this system of linear equations does not have to be nonsingular and the system 
does not even have to have a unique solution at all. For example, in the matrix above, we can 
easily see that it is singular if some circuit only consists of across-sources or if some cut only 
consists of through- sources. As a matter of fact, this is the only case when the through and 
across quantities are not defined uniquely if the constants Tj are real numbers with the same 
sign (and often otherwise too). 

We choose a specific spanning tree T to explore these concepts more carefully: 

Lemma. If no cut of G consists of only through- sources and no circuit of G consists of only 
across-sources, then G has a spanning tree T such that every across-source is a branch ofT 
and every through-source is a link ofT*. 
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Proof. If G satisfies the hypothesis, then we first choose a digraph M which has every vertex 
and across-source (arc) of G. There are no circuits in this digraph. Then we add components to 
M one by one and try to come up with a spanning tree. If this fails at some point, then G has a 
cut with only through-sources, which is impossible. □ 

Now let us assume that no cut of G consists of only through-sources and no circuit of G 
consists of only across-sources. We use the spanning tree T mentioned in the lemma. We 
rearrange the arcs of G so that (as before) the branches of T come first. Within these branches, 
the across-sources come first followed by components. Similarly, the links are rearranged so 
that the components come first and the through-sources come last. 

The system of 2m equations can then be written as 



/ 


o 


o 


o 





I 


O 


O 


O 


\ 


/ ii \ 




o 


-Ri 


o 





O 


I 


O 


O 




h 




o 


o 


-R 2 





O 


O 


I 


O 




h 




o 


o 


o 


I 


O 


O 


O 


O 




u 




i 


o 


Qn 


Ql2 


O 


O 


O 


O 




Ui 




o 


I 


Q21 


Q22 


O 


O 


O 


O 




u 2 




o 


o 





O 


B n 


B12 


I 


O 




u 3 


V 


o 


o 





O 


B21 


B22 


O 


I 


/ 


\u 4 J 






s 2 




V J 



<— across-sources (in branches) 

<— components (in branches) 

<— components (in links) 

<— through-sources (in links) 

^— fundamental cut sets (across-sources) 

^— fundamental cut sets (components) 

<— fundamental circuits (components) 

<— fundamental circuits (through-sources) 



where the I's are identity matrices of the right dimensions, the O's are zero matrices of the right 
dimensions and the O's are zero vectors of the right dimensions. 

Remark. Here we assume that G has all these four types of arcs (across-source branch, com- 
ponent branch, through-source link and component link). In other cases (for example, when 
there are no through-sources), we leave the corresponding rows, columns and elements out of 
the system of equations. Other cases are treated in a similar way. 



Solving the equations, we get 



Ui = Si 



u 2 = Rii 2 



U3 — R2I3 



i 4 = s 2 



which leaves this system of equations: 



I 

O 
O 



o 
I 

B12R1 
B 22 Ri 



Q11 
Q21 
R 2 
O 



\ 

o 
o 

1 / 



/ ii \ 
i 2 
13 

Vu 4 / 



/ Q12S2 \ 
Q22S2 
BnSi 
V B21S1 J 



Thus, 



ii = -Q11I3 - Qi 2 s 2 , i 2 = -Q21I3 - Q2 2 s 2 , u 4 = -B22R1I2 - B21S1. 
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Bn 


B12 


B21 


B22 



-Ql 



Qn 


Q12 


Q21 


Q22 



-Qn 


Q21 


~~ Q12 


— Q22 



From the results in the previous section, we get 
B ft = 

Therefore, B n = — Qn and B 12 = — Q^i- Finally, there is only one set of equations for i 3 : 

(Q21R4Q2I + R2)i3 = Qn s l — Q2lR'lQ22S2- 

The matrix 2 of this system of equations can written as 

QJ 1 R 1 Q 21 + R 2 = ( Ql 1 1 ) (^R7") (-%") • 

We can see that it is not singular if the diagonal elements of Ri and R 2 are all positive or all 
negative. 

Therefore, we get 

Theorem 4.6. If the constants rj are real numbers with the same sign, then the fundamental 
equations of the stationary linear network have a unique solution exactly when no cut of the 
network consists of only through-sources and no circuit of the network consists of only across- 
sources. 

From the theorem above, we notice that the number of equations we have to solve (numerically) 
is considerably fewer than 2m. 

Example. A mono-frequency AC circuit with passive elements (resistors, capacitors and induc- 
tors) can also be modelled as a stationary linear network (Theorem 4.6 does not apply). In the 
circuit below, the component values are R — 10 Q, C — 100 fiF, L — 10 mH and the current 
source is 

I = 10cos(1000t) A. 




The complex current of the source is 10e j1000 *, where j is the imaginary unit. The (angular) 
frequency is u = 1000 rad/s. There are no voltage sources. The corresponding digraph is 



2 This matrix is called the admittance matrix. Similarly, the impedance matrix can be constructed from the 
blocks of the fundamental circuit matrix 
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The voltages and currents written as complex exponentials are ik = I k e^ imot anduu = L^e- 71000 *. 
In particular, the current source is i$ = S5 = lOe-^ 1000 *. We get r^from the familiar formulae 
from electrical circuit analysis: 

n = r 4 = R = 10 , r 3 = — — = -lOj , r 2 = juL = lOj. 

We choose the arcs t\ and e 2 as the branches of the spanning tree T. Because of the linearity 
of the system of equations, the exponential factors e jl00 °* cancel out and we get 
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Notice that we have left out the across-sources because there are none. This system is easily 
solved using computer programs, e.g. MATLAB: 
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»UV=inv (H) *s; 

»[UV angle (UV) abs (UV) ] 



ans = 



-6.0000 + 2.0000i 2.8198 

-2 .0000 + 4 . OOOOi 2 .0344 

-2.0000 - 6. OOOOi -1.8925 

-2 . 0000 + 4 . OOOOi 2 .0344 

10.0000 

-60.0000 +20. OOOOi 2.8198 



6.3246 
4 . 4721 
6.3246 
4 .4721 
10.0000 
63.2456 
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-40.0000 -20.0000i -2.6779 44.7214 

-60.0000 +20.0000i 2.8198 63.2456 

-20.0000 +40.0000i 2.0344 44.7214 

-60.0000 +20.0000i 2.8198 63.2456 

Thus, for example, the complex voltage across the current source is 

u 5 = = 63.25e^ 1000 * +2 - 82 ) 

and the real voltage is 63.25 cos(1000t + 2.82) V. 

Kirchhoff 's Through-Quantity Law can also be written in the form 

Ai = n , 

where A is the all-vertex incidence matrix of G. Furthermore, 

A T p = — u. 

Hence 

u • i = u T i = — p T Ai = 0. 

This result only depends on the structure of the digraph G (through the all-vertex incidence 
matrix). Now we get the famous theorem: 

Theorem 4.7. (Tellegen) If two stationary linear networks have the same digraph with corre- 
sponding through-vectors ii and i 2 as well as corresponding across-vectors Ui and u 2 , then 

Ui • i 2 = and u 2 • ii = 0. 

If we apply this to the case when the two networks are exactly the same (= G), then we get 

u • i = 0, 

known as the Law of Conservation of Energy. 

Remark. More details on this subject can be found e.g. in Swamy & Thulasiraman or 
Vago, as well as Dolan & Aldous. 

4.5 Matrices over GF(2) and Vector Spaces of Graphs 

The set {0, 1} is called afield (i.e. it follows the same arithmetic rules as real numbers) if 
addition and multiplication are defined as follows: 
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In this case -1 = 1 and l" 1 = 1. This is the field GF(2). 

If we think of the elements and 1 of the all-vertex incidence, cut, fundamental cut, circuit 
and fundamental circuit matrices of a ("undirected") graph as elements of the field GF(2), then 
Theorems 4.1, 4.2, 4.4, 4.5 and their corollaries also apply to "undirected graphs". (Keep in 
mind that — 1 = 1 in the field GF(2).) The proofs are the same. 
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For "undirected" graphs, the vector spaces are over the field GF(2). For directed graphs, the 
vector spaces are real (i.e. over the field W). The row space of the cut matrix of a (di)graph is 
the cut space. Similarly, the row space of the circuit matrix is the circuit space. The dimension 
of the cut space is the rank of the (di)graph and the dimension of the circuit space is the nullity 
of the (di)graph. Furthermore, the cut space and the circuit space are orthogonal complements. 
(All of these statements follow directly from the results above.) 

Often, we deal with the above mentioned spaces through subgraphs, i.e. we identify a vector 
with the subgraph generated by the corresponding arcs. In the case of "undirected" graphs, the 
addition of GF(2) vectors corresponds to the ring sum operation. 



Chapter 5 
Graph Algorithms 



5.1 Computational Complexity of Algorithms 

The complexity of a problem is related to the resources required to compute a solution as a 
function of the size of the problem. The size of a problem is measured by the size of the input N, 
and the resources required are usually measured by time (number of steps) and space (maximum 
amount of memory measured appropriately). Decision problems or yes-or-no questions are very 
common. Read Hopcroft & Ullman for classical complexity theory. 

To make computational complexities comparable, we need to agree on some specific math- 
ematical models for algorithms. For example, consider computing with Turing Machines and 
refer to courses in Formal Languages, Theory of Automata and Mathematical Logic. We have 
deterministic and nondeterministic version of algorithm models. In the deterministic version, 
there are no choices to be made. In the nondeterministic version, there is a choice to be made 
somewhere on the way. For a nondeterministic algorithm, we have to make the following as- 
sumptions so that we can actually solve problems: 

1 . The algorithm terminates at some point no matter how we choose the steps. 

2. The algorithm can terminate without yielding a solution. 

3. When the algorithm terminates and yields a solution, the solution is correct (it is possible 
to have more than one solution). 

4. For decision problems, if the algorithm fails to give a positive answer (yes), then the 
answer is interpreted to be negative (no). 

5. If the problem is to compute a value, then the nondeterministic algorithm has to give a 
solution for every input (value of the function). 

Nondeterministic algorithms are best treated as verification procedures for problems rather than 
procedures for producing answers. 

Computational complexity is considered asymptotically, that is for large problems, time 
or space complexities that differ by constant coefficients are not distinguished because linear 
acceleration and compression of space are easy to perform in any kind of algorithm model. 
Although the choice of an algorithm model has a clear impact on the complexity, it is not an 
essential characteristic, i.e. it does not change the complexity class. Often, we use the big-0 
notation for complexities. 0(f(N)) refers to the class of functions g(N) such that if iV > N 
holds, then \g(N)\ < Cf(N) holds, where C is a constant. 
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Without exploring algorithm models any further, we define a couple of important complex- 
ity classes. The time complexity class V {deterministic polynomial time problems) consists of 
problems of (input) size N where it takes at most p(N) steps to solve the problem using deter- 
ministic algorithms. p(N) is some problem dependent polynomial of N. The time complexity 
class MV (nondeterministic polynomial time problems) consists of problems of size N where it 
takes at mostp(iV) steps to solve the problem using nondeterministic algorithms. Once again, 
p(N) is some problem dependent polynomial of N. 

Time complexity class co-NT {complements of nondeterministic polynomial time prob- 
lems) consists of decision problems whose complements are in J\fV. (The complement of a 
problem is obtained by swapping the positive and the negative answer.) 

Obviously, V C MV and (for decision problems) V C co—AfV. Whether or not the inclu- 
sion is proper is an open problem, actually quite a famous problem. It is widely believed that 
both of the inclusions are proper. It is not known if the following holds for decision problems: 

MV = co-AfV or V = MV n co-AfV 

Most researchers believe that they do not hold. 

The space complexity class VSVACS {deterministic polynomial space problems) consists 
of problems of (input) size N where it takes at most p(N) memory units to solve the prob- 
lem using deterministic algorithms. p(N) is some problem dependent polynomial of N. The 
space complexity class AfVSVACS {nondeterministic polynomial space problems) consists of 
problems of size N where it takes at most p(N) memory units to solve the problem using non- 
deterministic algorithms. Once again, p(N) is some problem dependent polynomial of N. It is 
known that 

MV c VSVACS = AfVSVACS, 

but it is not known whether the inclusion is proper or not. 

An algorithm may include some ideally generated random numbers. The algorithm is then 
called probabilistic or stochastic. The corresponding polynomial time complexity class is BVV 
{random polynomial time problems or bounded-error probabilistic polynomial time problems). 
Some stochastic algorithms may fail occasionally, that is, they produce no results and terminate 
prematurely. These algorithms are called Las Vegas algorithms. Some stochastic algorithms 
may also produce wrong answers (ideally with a small probability). These kind of algorithms 
are called Monte Carlo algorithms. Some stochastic algorithms seldom yield exact solutions. 
Nevertheless, they give accurate approximate solutions with high probability. These kind of 
algorithms are called approximation algorithms. 

The task of an algorithm may be to convert a problem to another. This is known as reduction. 
If problem A can be reduced to another problem B by using a (deterministic) polynomial time 
algorithm, then we can get a polynomial time algorithm for problem A from a polynomial time 
algorithm for B. A problem is AfV-hard if every problem in J\fV can be reduced to it by a 
polynomial time algorithm. A/'P-hard problems are AfV -complete if they are actually in J\fV. 
MV -complete problems are the "worst kind". If any problem in MV could be shown to be 
deterministic polynomial time, then every problem in MV would be in V and V = MV . Over 
one thousand A/'P-complete problems are known currently. 

The old division of problems into tractable and intractable means that V problems are 
tractable and others are not. Because we believe that V ^ AfV in general, A/'P-complete 
problems are intractable. In the following, graph algorithms are either in V or they are ap- 
proximations of some more demanding problems. The size of an input can be for example the 
number of nonzero elements in an incidence matrix, the number of vertices n or the number of 
edges m or some combination of n and m. 
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5.2 Reachability: WarshalPs Algorithm 

We only deal with directed graphs in this section. The results also hold for "undirected" graphs 
if we interpret an edge as a pair of arcs in opposite directions. 

Problem. We are given an adjacency matrix of the digraph G = (V, E). We are to construct 
the reachability matrix R = (r^) ofG, where 




lifG has a directed Vi-Vj path 
otherwise. 



(Note that V = {vi, . . . ,v n }.) In particular, we should note that if ra = 1, then Vi is in a 
directed circuit. 

WarshalV s Algorithm constructs a series ofnxn matrices E l5 . . . , E n where 

1 . elements of Ej are either zero or one. 

2. Ej < Ej + i (i — 0, . . . , n — 1) (comparison is done element by element). 

3. E is obtained from the adjacency matrix D by replacing the positive elements with ones. 

4. E n = R. 

The algorithm is presented as a pseudocode: 

procedure Warshall 
begin 

E := Eg 

for % := 1 to n do 
for j :— 1 to n do 

if (E)ji = 1 then for k := 1 to n do 

(E) jfc := max((E) jfc , (E) ifc ) 

fi 

od 

od 
end 

In this case, the maximizing operation is sometimes called the Boolean sum: 



max 





1 








1 


1 


1 


1 



Let us show that Warshall's Algorithm gives us the desired results. Let Ej denote the value 
of E after % steps. 

Statement, (i) If there is a directed path from v s to v t such that apart from v s and v t , the path 
only includes vertices in the set {v±, . . . , Vj}, then (Ej) st = 1. 

(ii) If vertex v s belongs to a directed circuit whose other vertices are in the set {vi, . . . , v «}, 
then (Ei) ss = 1. 
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Proof. We will use induction on i. 

Induction Basis : i = 1. (Ei) st = 1 if (E ) s t = 1, or (E ) s i = 1 and (E )it = 1. We have 
one of the following cases: 



V 




s 



Induction Hypothesis: The statement is true for i <£.(£> 2) 
Induction Statement : The statement is true for % — L 

Induction Statement Proof : Let us handle both statements together. The proof for (ii) is 
given in square brackets. We have two cases: 

• vt belongs to the directed path [resp. directed circuit] but £ ^ s,t [resp. £ ^ s]. Then, we 
use the Induction Hypothesis: 

(E £ _x) s£ = 1 and (E^_ x ) ft = 1 [resp. (E*_i) a < = 1 and (E^_ x ) fa = 1], 

so (E e ) st = 1 [resp. (E £ ) ss = 1]. 

• ve is either v s or v t [resp. v t is v s ] or it does not belong to the directed path [resp. directed 
circuit] at all. Then, by the Induction Hypothesis 

(E^_i) st = 1 [resp. (E £ _i) ss = 1], 

so (E e ) st = 1 [resp. (E e ) ss = 1]. □ 

In Warshall's Algorithm, the maximizing operation is performed at most n 3 times. 



5.3 Depth-First and Breadth-First Searches 

Problem. We have to traverse through a (di) graph to find some kind of vertices or edges. 

We assume that the (di)graph is connected and loopless. For disconnected graphs, we have to 
go through the components separately. We ignore loops if the (di)graph has any. 

Depth-First Search, DFS, has many uses. The procedure is a bit different for undirected 
graphs and directed graphs. Therefore they will be treated separately. 



Undirected Graphs 

We choose a starting vertex r (root) to start the search. Then, we traverse an edge e = (r, v) 
to go to the vertex v. At the same time, we direct e from r to v. Now, we say that the edge e 
is examined and we call it a tree edge. The vertex r is called the father of v and we denote it 
r = FATHER(w). 

We continue the search. At a vertex x, there are two cases: 
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(1) If every edge incident to x has been examined, return to the father of x and continue the 
process from FATHER(x). The vertex x is said to be completely scanned. 

(2) If there exist some unexamined edges incident to x, then we choose one such edge e = 
(x, y) and direct it from x to y. This edge is now said to be examined. We have two 
subcases now: 

(2.1) If y has not been visited before, then we traverse the edge (x, y), visit y and continue 
the search from y. In this case, e is a tree edge and FATHER(y) = x. 

(2.2) If y has been visited before, then we select some other unexamined edge incident to 
x. In this case, the edge e is called a back edge. 

Every time we come to a new vertex which has never been visited before, we give it a distinct 
number. The number of the root is 1 . We write 

DFN(x) = running number of vertex x. 

A complete DFS ends when we traverse back to the root and we have visited every vertex or 
when we have found the desired edge/vertex. 

DFS divides the edges of G into tree edges and back edges. Obviously, the tree edges form 
a spanning tree of G, also known as a DFS tree. If we include the directions of the tree edges, 
we get a directed DFS tree. DFS gives a direction to every edge in G. When we use these 
directions, we get a digraph whose underlying subgraph is G. It has the DFS tree as a directed 
spanning tree. 

Example. For the graph 




we start the DFS from a root in the upper left corner. The back edges are marked with two lines. 
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In the following, we denote, 

. [ if vertex x has not been visited 
K[x) = < 

I 1 if vertex x has been visited 

and TREE and BACK are set variables containing the directed tree edges and back edges. 
Depth-First Search for Graphs: 

1. Set TREE <- 0, BACK <- and % 4- 1. For every vertex x of G, set FATHER(x) <- 
and K{x) 4- 0. 

2. Choose a vertex r for which K(r) = (this condition is needed only for disconnected 
graphs, see step #6). Set DFN(r) «— i, K{r) 1 and u ^— r. 

3. If every edge incident to u has been examined, go to step #5. Otherwise, choose an edge 
e = (u, v) that has not been examined. 
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4. We direct edge e from u to v and label it examined. 

4.1 lfK(v) = 0, then set i <- i + 1, DFN(v) <- i, TREE <- TREE U {e}, K(v) <- 1, 
FATHER(v) <r- u and u <- v. Go back to step #3. 

4.2 If = 1, then set BACK <- BACK U {e} and go back to step #3. 

5. If FATHER(w) ^ 0, then set u <- FATHER(w) and go back to step #3. 

6. (Only for disconnected graphs so that we can jump from one component to another.) If 
there is a vertex r such that K(r) = 0, then set % <— % + 1 and go back to step #2. 

7. Stop. 

We denote T as the DFS tree and G as the directed graph obtained from the algorithm. T 
is a directed spanning tree of G. If there is a directed path from u to v in T, then we call u an 
ancestor of t> and t> a descendent of w. Vertices -u and t> are related if one of them is an ancestor 
of the other. In particular, if (u, v) is an edge of T, then u is the father of v and v is a son of u. 
An edge (u, t> ) of G, where u and v are unrelated, is called a cross edge. However, 

Statement. Cross edges do not exist. 

Proof. Let u and v be two distinct vertices which are unrelated. Then, (by quasi-strong connec- 
tivity) there are two vertices u' and v' such that 

• FATHER(w') = FATHER^'), 

• u' = u or u' is an ancestor of u and 

• v ' = v or v 1 is an ancestor of v. 

We examine the case where DFN(u') < DFN(w') (the other case is obviously symmetrical). 
We label T x as the directed subtree of T whose root is v! and T 2 as the directed subtree of T 
whose root is v'. Obviously, DFS goes through the vertices of T 2 only after u' is completely 
scanned. Furthermore, v! is completely scanned only after all the vertices of T\ are completely 
scanned. Hence, it is impossible to have an edge (u, v). □ 

Directed Graphs 

Depth-first search in a (connected and loopless) digraph G is similar to the case for undi- 
rected graphs. The algorithm divides the arcs in G into four different classes. If the search 
proceeds to an unexamined arc e = (x, y), then the four possible classes are: 

(1) If y has not been visited, then e is a tree edge. 

(2) If y has been visited, then there are three cases: 

(2.1) y is a descendent of x in the subgraph induced by existing tree edges. Then, e is a 
forward edge and DFN(y) > DFN(x). 

(2.2) x is a descendent of y in the subgraph induced by the existing tree edges. Then, e is 
a back edge and DFN(y) < DFN(x). 

(2.3) x and y are not related by any of the existing tree edges. Then, e is a cross edge 
and DFN(y) < DFN(x). (Note! It is impossible that DFN(y) > DFN(x). This is 
proven in the same way as we did previously.) 



CHAPTER 5. GRAPH ALGORITHMS 



57 



The directed subgraph of G induced by tree edges is called the DFS forest (directed forest). 

If DFN(y) > DFN(x) holds for the arc (x, y), then (x, y) is a tree edge or a forward edge. 
During the search, it is easy to distinguish the two because (x, y) is a tree edge if y has not been 
visited and it is a forward edge otherwise. If DFN(y) < DFN(x), then (x, y) is a back edge or 
a cross edge. During the search, it is easy to distinguish the two because (x, y) is a cross edge 
if y is completely scanned and it is a back edge otherwise. 

In the following, K, FATHER, TREE and BACK are defined as previously. We also have 
two new variables FORWARD and CROSS (their meanings are obvious) and 



Depth-First Search for Digraphs: 

1. Set TREE 4- 0, FORWARD 4- 0, BACK 4- 0, CROSS 4- and i 4- 1. For every 
vertex x in G, set FATHER(rr) 4- 0, K(x) 4- and L(x) 4- 0. 

2. Choose a vertex r such that K{r) = and set DFN(r) <— i, if (r) 1 and u 4— r. 

3. If every arc coming out of u has already been examined, then set L(u) 4— 1 and go to 
step #5. Otherwise, choose an unexamined arc e = (u, v). 

4. Label the arc e examined. 

4.1 If = 0, then set i <- i + 1, DFN(v) <- i, TREE <- TREE U {e}, K(v) <- I, 
FATHER^) i — u and u 4 — v. Go to step #3. 

4.2 If K(v) = 1 and DFN(-u) > DFN(u), then set FORWARD 4- FORWARD U {e} 
and go to step #3. 

4.3 If K{v) = 1 and DFN(-y) < DFN(m) and L(v) = 0, then set BACK 4- BACK U 
{e} and go to step #3. 

4.4 lfK(v) = 1 and DFN(-y) < DFN(m) and L(v) = 1, then set CROSS 4- CROSS U 
{e} and go to step #3. 

5. If FATHER(w) ^ 0, then set u 4- FATHER(m) and to go step #3. 

6. If there is a vertex r such that K(r) = 0, then set % <— i + 1 and go to step #2. 

7. Stop. 

Example. DFS in the following digraph starts from a root in the upper left corner and proceeds 
like this (back edges are marked with one line, cross edges are marked with two lines and 
forward edges are marked with three lines): 




1 if x is completely scanned 
otherwise. 
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Theorem 5.1. If a depth-first search in a quasi- strongly connected digraph starts from one of its 
roots, then the DFS forest is a directed tree. In particular, the DFS forest of a strongly connected 
digraph is a directed tree no matter where the search starts from. 

Proof. Let us prove by contradiction and consider the counter hypothesis: The DFS forest T 
resulted from a DFS in a quasi-strongly connected digraph G that began from root r is not a 
directed tree. 

Since T is a directed forest, the component Ti of T which has the root r does not contain 
some vertex v of G. On the other hand, there is a directed path from r to v. We choose the last 
vertex u on this path which is in Ti and the arc e = (u, w). Since the vertex w is not in Ti, 
the edge e is not a tree edge, a back edge nor a forward edge. Then, it must be a cross edge. 
Because the search began at r, the vertex w has to be in Ti (y/ ). 

Strongly connected digraphs are also quasi-strongly connected and any vertex can be chosen 
as a root. □ 

Breadth-first search, BFS, is related to DFS. Let us consider a connected graph G. 

Breadth-First Search for Graphs: 

1. In the beginning, no vertex is labeled. Set % <— 0. 

2. Choose a (unlabeled) starting vertex r (root) and label it with i. 

3. Search the set J of vertices that are not labeled and are adjacent to some vertex labeled 
with i. 

4. If J ^ 0, then set % <— % + 1. Label the vertices in J with % and go to step #3. 

5. (Only for disconnected graphs so we can jump from one component to another.) If a 
vertex is unlabeled, then set i <— and go to step #2. 

6. Stop. 

BFS also produces a spanning tree, called the BFS tree, when we take the edges 

(vertex labeled with i, unlabeled vertex) 

while forming J. One such tree edge exists for each vertex in J. We obtain the directed BFS tree 
by orienting the edges away from the labeled vertex to the unlabeled vertex. BFS as presented 
above does not however orient every edge in the graph. Obviously, the label of a vertex is the 
length of the shortest path from the root to it, in other words, the distance from the root. 

Example. BFS in the graph we had in the previous example starts at a root in the upper left 
corner and proceeds as follows. (Tree edges are marked with two cross lines.) 
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The corresponding BFS tree is 




We obtain the directed BFS tree by orienting the branches away from the root. 
BFS in a digraph G is very similar to what we just did. 

Breadth-First Search for Digraphs: 

1. In the beginning, no vertex is labeled. Set i <- 0. 

2. Choose an unlabeled starting vertex r (root) and label it with i. 

3. Search the set J of terminal vertices of arcs whose initial vertices have been labeled with 
% but whose terminal vertices have not been labeled. 

4. If J ^ 0, then set i 4— i + 1. Label the vertices in J with i and go to step #3. 

5. If not all vertices have been labeled, then set i <- and go to step #2. 

6. Stop. 
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BFS in a digraph produces a BFS forest (directed forest) when we take the examined arcs 

(vertices labeled withi, unlabeled vertices) 
while forming J. One such tree edge exists for each vertex in J. 
Remark. In addition, BFS can be modified to sort the arcs like DFS. 

5.4 The Lightest Path: Dijkstra's Algorithm 

Problem. The edges of a (di)graph are given non-negative weights. The weight of a path is 
the sum of the weights of the path traversed. We are to find the lightest (directed) path in the 
(di) graph from vertex u to vertex v u) if the path exists (sometimes also called the shortest 
path). We should state if such path does not exist. 

Obviously, we can assume that we do not have any loops or parallel edges. Otherwise, we 
simply remove the loops and choose the edge with the lowest weight out of the parallel edges. 
From now on, we only consider directed graphs. Undirected graphs can be treated in the same 
way by replacing an edge with two arcs in opposite directions with the same weight. 

We denote a(r, s) as the weight of the arc (r, s). Dijkstra's Algorithm marks the vertices as 
permanent or temporary vertices. The label of a vertex r is denoted f3(r) and we define 



7(r) 



1 if the label is permanent 
if the label is temporary. 



A permanent label f3(r) expresses the weight of the lightest directed u-r path. A temporary 
label /3(r) gives an upper limit to this weight (can be oo). Furthermore, we denote: 



7r(r) = 



the predecessor of vertex r on the lightest directed u-r path if such a path exists 
otherwise, 



so we can construct the directed path with the lowest weight. 
Dijkstra's Algorithm: 

1. Set f3(u) and ^(u) 1. For all other vertices r, set f3(r) oo and 7(7") 0. For 
all vertices r, we set n(r) 0. Furthermore, set w u. 

2. For every arc (w, r), where 7(7") = and f3(r) > /3(w) + a(w, r), set 

f3(r) <— f3(w) + a(w, r) and n(r) ^— w. 

3. Find a vertex r* for which 7(7-*) = 0, P(r*) < 00 and 

0(r*) = min {(3(r)}. 

7(r)=0 

Set 

7(7"*) ^— 1 and w ^— r*. 
If there is no such vertex r*, a directed u-v path does not exist and we stop. 
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4. If w 7^ f , then go to step #2. 



5. Stop. 



We see that the algorithm is correct as follows. We denote (for every step): 



Vi = {permanently labeled vertices} 
V 2 = {temporarily labeled vertices}. 



((Vi, V 2 ) is a cut with the completely scanned vertices on one side and other vertices on the 
other side.) 

Statement. The label f3(r) of the vertex r in V\ is the weight of the lightest directed u-r path 
and 7r(r) is the predecessor ofr on such a path. 

Proof. After step #2, the temporary label of r is always the weight of a directed u-r path with 
the lowest weight whose vertices are in V\ except for r (= oo if there is no such path), and ir(r) 
is a predecessor of r on this path (or = 0). This is because (two cases): 

• Before step #2, f3(r) = oo. The only "new" vertex in V\ is now w so every possible 
directed u-r path has to visit w. If there is no such path, then the case is obvious (/3(r) 
stays at oo and ix{r) stays at zero). Let us assume that we have the (lightest) directed 
u-r path that contains only vertices of V\ and r as well. In particular, w is included. The 
subpath from u to w has of course the lowest weight. We consider the vertex s (G Vi) 
which is the predecessor of r on the directed u-r path. If s = w, then the case is clear. If 
s ^ w, then s has been a w before, in which case f3(r) can not be = oo (step #2) ). 

• Before step #2, j3(r) < oo. Then, j3(r) is the weight of the lightest directed u-r path 
whose vertices are in V\ — {w} except for r. The only "new" vertex in V\ is w so every 
possible lighter directed u-r path has to visit w. If there is no such path, then the case 
is obvious (/3(r) and ir(r) remain unchanged). Let us assume that we have a (lighter) 
directed u-r path that contains only vertices of V\ and r as well. In particular, w is 
included. The subpath from u to w has of course the lowest weight. We consider the 
vertex s (G Vi) which is the predecessor of r on the directed u-r path. If s — w, then 
the case is clear. If s ^ w, then s is in V\ — {w}. Since s has been a w before, there is 
a lightest directed u-s path that does not contain w (otherwise, we should have chosen 
r* in step #3 to be some predecessor of s on the directed u-w-s path). Then, we get a 
directed u-r path with a lower weight that contains r and only vertices in Vi — {w} (y/ ). 

The permanent label is the weight we seek because of the minimization in step #3 and n(r) 
gives a predecessor of r as we claimed. □ 

At the end of the algorithm, vertex v gets a permanent label or the process stops at step #3 
(which means a directed u-v path does not exist). The directed path with the lowest weight can 
be obtained by starting from the vertex v and finding the predecessors by using the label it. 
If we replace step #4 by 

4'. Go to step #2. 

and continue the process until it stops at step #3, we get 




CHAPTER 5. GRAPH ALGORITHMS 



63 



and 



TC(W 



the predecessor of w on the lightest directed u-w path if there is one and w ^ u 
otherwise. 



Remark. Dijkstra 's algorithm may fail if there are negative weights. These cases are investi- 
gated in the next section. 



5.5 The Lightest Path: Floyd's Algorithm 

Problem. We are to find the lighest path from vertex u to vertex v u) in a digraph or to show 
that there is no such path when the arcs of the digraph have been assigned arbitrary weights. 
Note that the weight of a directed path is the sum of the weights of the arcs traversed. 

Obviously, we can assume there are no loops or parallel arcs. Otherwise, we simply remove the 
loops and choose the arc with the lowest weight out of the parallel arcs. Floyd's Algorithm only 
works for digraphs. We write the weight of (x, y) as a(x, y) and construct the weight matrix 
W = (wij) where 

{a(vi, Vj) if there is an arc hi, vA 
oo otherwise. 

(Once again, V = {v 1 , . . . , v n } is the vertex set of the digraph.) Floyd's Algorithm is similar 
to Warshall's Algorithm. It only works if the digraph has no negative cycles, i.e. no directed 
circuit in the digraph has a negative weight. In this case, the lightest directed path is the lightest 
directed walk. 

Floyd's Algorithm constructs a sequence of matrices W , Wi, . . . , W n where W = W 

and 

(Wfe)jj = weight of the lightest directed v~Vj path, 

where there are only vertices vi , . . . , on the path besides Vi and Vj 

( = oo if there is no such path). 
Statement. When is computed from by the formula 

(W k ) st = min{(W fc _i) st , (W k ^) sk + (W fc _i)fct}, 

then we get the previously mentioned sequence of weight matrices. If the digraph has negative 
cycles, then the sequence is correct up to the point when one of the diagonal elements turns 
negative for the first time. 

Proof. We use induction on k. 

Induction Basis : k — 1. Since the digraph is loopless, the diagonal elements of W can only 
be oo and the lightest directed path (if there is one) is one of the following, and the statement is 
obvious: 
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Induction Hypothesis: The statement is true for k <£.(£> 2) 
Induction Statement : The statement is true for k — £. 

Induction Statement Proof : The diagonal elements of W^_ x have to be nonnegative (oo is 
permitted) for us to get this k. Let us consider the case where s ^ t. (The case s = t is 
analogous.) We have five cases: 

• Vertex vg is on the lightest directed path but it is not v s or v t , i.e. £ ^ s,t. Let us 
consider the directed subpath from v s to vg whose vertices other than v s and vg are in 
{v i, . . . , ve-i}. Suppose the lightest directed v s -vg path of this kind has common vertices 
with the directed subpath from vg to v t other than vg itself, e.g. v p . The directed v s -v p - 
vg-v t walk we get would be lighter than the original directed v s -v t path. By removing 
cycles, we would get a directed v s -v t path that would be lighter and would only contain 
v s as well as v t and the vertices Vi, . . . ,vg (y/ ). (We have to remember that weights of 
cycles are not negative!) Therefore, the directed subpath from v s to vg is the lightest di- 
rected v s -vg path which contains the vertices v i, . . . , vg-i as well as v s and vg. Similarly, 
the directed subpath from vg to v t is the lightest directed vg-v t path which contains the 
vertices v i, . . . , vg_\ as well as v t and vg. Now, we use the Induction Hypothesis: 

(w,) si < (Wt-iU 

(check the special case (W^_i) st = oo) and 

(w,) si = (Wt-iU + (w^i) ft . 

• The directed v s -v t path with the lowest weight exists and vg = v s . By the Induction 
Hypothesis, (W^) si = (W M ) S( and 

since (Wg-x)gg > (possibly = oo). 

• The directed v s -v t path exists and vg = v t . By the Induction Hypothesis, (W^) si = 
(W^_i) si and 

(Wt-iU + (We-i)et = (We-i) s e + (W/_i)« > (Wg^) s g = (Wg^) st , 
since (W<_i)« > (possibly = oo). 

• The lightest directed v s -v t path exists but vg is not on the path. Now, we construct the 
lightest directed v s -vg path and the lightest vg-v t path which, in addition to the end ver- 
tices, contain only vertices v i, . . . , vg-±, if it is possible. By combining these two paths, 
we get a directed v s -v t walk. By removing possible cycles from this walk, we get an as 
light or even lighter v s -v t path, which only contains vertices v 1: . . . , vg as well as v s and 
v t . (We have to remember that weights of cycles are not negative!) Therefore, this is a 
case where 

(Vft-iU + (W/_i) ft > (Wg-jst 

and the equation in the statement gives the right result. If there is no directed v s -vg path 
or vg-v t path, then it is obvious. 

• The lightest directed v s -v t path does not exist. Then, (W^) si = oo and (W^_i) si = oo. 
On the other hand, at least one of the elements (W^_i)^ or (Wg-i)g t is = oo because 
otherwise we would get a directed v s -v t path by combining the v s -vg path with the vg-v t 
path as well as removing all possible cycles (^/ ). □ 
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Floyd's Algorithm also constructs another sequence of matrices Z , . . . , Z„ in which we 
store the lightest directed paths in the following form 



£ where t> e is the vertex following Vi on the lightest directed 

v—Vj path containing only vertices vi and i> ^ as well as Vi , . . . , v k 
(if such a path exists) 

otherwise. 



Obviously, 

(Zo) . . = / J if ( W ^ ^ 00 
[ otherwise. 

The matrix Z k (k > 1) of the sequence can be obtained from the matrix Z fe _i by 



(Zfe)jj — 



(Zfe-i)jfe if (W fc _i) ifc + (W fc _i) fcj < (W fc 
(Z k _x)ij otherwise, 



so the sequence can be constructed with the sequence W , W 1; . . . , W n at the same time. 
Finally, Floyd's Algorithm is presented in the following pseudocode. We have added a part 
to test if there are negative elements on the diagonal and the construction of the Z , . . . , Z n 
sequence of matrices. 



procedure Floyd 
begin 

W := W 

k := 

for % := 1 to n do 
for j :— 1 to n do 
if (W)jj = oo then 

(Z)y := 
else 

fi 

od 

od 

while k < n and Test(W) do 
Iteration(W, Z, k) 

od 
end 



subprocedure Test(W) 
begin 

for % := 1 to n do 
if (W)^ < then 
return FALSE 

fi 

od 

return TRUE 
end 
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subprocedure Iteration(W, Z, k) 
begin 

k := k + 1 
for i := 1 to n do 
for j :— 1 to n do 

if (W)* + (W) fcj < (W)y then 
(W) - := (W) ik + (W) kj 
(Z)ij := (Z) ik 

fi 

od 

od 
end 

5.6 The Lightest Spanning Tree: Kruskal's and Prim's Algo- 
rithms 

Problem. We have to find the spanning tree with the lowest weight of a connected graph if the 
edges of the graph have been weighted arbitrarily and the weight of a tree is the sum of all the 
weights of the branches. 

Obviously, we can assume that the graph G = (V, E) is nontrivial and simple. Otherwise, we 
simply remove the loops and choose the edge with the lowest weight out of the parallel edges. 
We denote the weight of the edge e as a(e) and the weight of the spanning tree T as 7(T). 
As usual, we write the number of vertices as n, number of edges as m, V — {v± : . . . , v n } and 
E = {d, . . . ,e m }. 

The distance between two spanning trees 7\ and T 2 of G is 

n - 1 - #(71 D T 2 ) =drf.d(Ti,T 2 ), 

where # (7\ flT 2 ) is the number of edges in the intersection of 7\ and T 2 . Obviously, d(Ti , T 2 ) = 
if and only if T\ = T 2 . If d(Ti, T 2 ) = 1, then T x and T 2 are neighboring trees. 

The spanning tree T of G is cut minimal if the weights of the edges of the fundamental 
cut set determined by the branch b are > a(b) for every branch b. Similarly, the spanning tree 
T is circuit minimal if the edges of the fundamental circuits are < a(c) for every link c in 
the cospanning tree T*. The spanning tree T is locally minimal if 7(T) < 7(T') for every 
neighboring tree T'ofT. 

Lemma. The following three conditions are equivalent for the spanning tree T: 

(i) T is cut minimal. 

(ii) T is circuit minimal. 

(iii) T is locally minimal. 

Proof. (i)=Kii): Let us assume T is cut minimal and let us consider a fundamental circuit C of 
G corresponding to the link c of the cospanning tree T*. Other than c, the branches in C are 
branches of T. Every such branch b defines a fundamental cut set of T, which also contains c 
(Theorem 2.7). Hence a (b) < a(c). 

(ii)=Kiii): Let us assume that T is circuit minimal and let us consider a neighboring tree V 
of T. T' has (exactly one) branch e! which is not in T, i.e. e! is a link of T*. We examine the 
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fundamental circuit C defined by e'. Not all edges of C are in V . We choose an edge e in C 
that is not in V . Then, e is a branch of T (actually the only branch of T that is not in T'). Now, 
we remove e out of T and add e' to T. The result has to be V . Because of circuit minimality, 
a(e') > a(e),i.e. ~f(T') > 7 (T). 

(iii)=Ki): We consider the locally minimal spanning tree T. We take an arbitrary branch 
b from T corresponding to a fundamental cut set / and an arbitrary link c ^ b in /. Then, 6 
belongs to the fundamental circuit of T defined by c (Theorem 2.8). By removing the branch 
b from T and adding the edge c to T, we get the neighboring tree T' of T. Because of local 
minimality, 7 (T) < 7(T'), i.e. a(c) > a(b). □ 

The spanning tree T is minimal if it has the lowest possible weight. 

Theorem 5.2. The following three conditions are equivalent for the spanning tree T: 

(i) T is cut minimal. 

(ii) T is circuit minimal. 

(iii) T z's minimal. 

Proof. By the lemma above, (i) and (ii) are equivalent. A minimal spanning tree is obviously 
locally minimal. Thus, it suffices to prove that a cut minimal spanning tree is also minimal. 
We will prove by contradiction and consider the counter hypothesis: There is a cut minimal 
spanning tree T which is not minimal. Let us consider the minimal spanning tree T' and choose 
T and T so that the distance d(T, V) is as small as possible. By the lemma, d(T, T') > 1. 

T has a branch e which is not in T', i.e. it is a link of (T")*. We label the fundamental 
cut set of T defined by e as I and the fundamental circuit of V defined by e as C. In the 
intersection I DC, there are also other edges besides e (Theorem 2.6). We choose such an edge 
e'. Then, e' is a link of T* and a branch of T". Since T is cut minimal, a(e') > a(e). Since 
T" is (circuit) minimal, a(e') < a(e). Therefore, a{e!) = a(e). By removing e! from T' and 
adding e to T', we get a minimal spanning tree T" which has the same weight as T' . However, 
d(T,T") < d(T,T'). y/ □ 

In Kruskal's Algorithm, the edges of the graph G (and their weights) are listed as e±, . . . , e m . 
The algorithm constructs a circuit minimal spanning tree by going through the list to take some 
edges to form the tree. This is especially effective if the edges are sorted in ascending order by 
weight. 

In the dual form of Kruskal's Algorithm, we construct a cut minimal spanning tree by going 
through the list of edges to take some edges to form the cospanning tree. Once again, this is 
especially effective if the edges are sorted in descending order by weight. 

In all, we get four different versions of Kruskal's Algorithm. (We have to remember that the 
subgraph induced by the edge set A is written as (A).) 

Kruskal's Algorithm No. 1 

Here we assume that the edges are given in ascending order by weight. 

1. Set k <- 1 and A <- 0. 

2. If efe does not form a circuit with the edges in A, then set A ^— A U {e^} as well as 
k <r- k + 1 and go to step #4. 

3. If efc forms a circuit with the edges in A, then set k <— k + 1 and go to step #4. 
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4. If (V, A) is not a tree, then go to step #2. Otherwise stop and output the spanning tree 

T=(A). 

Whenever we leave out an edge from A (step #3), its end vertices are already connected in A. 
Thus, the vertices of G are connected in T as they are in G. Since T is obviously circuitless 
(step #3), it is also a spanning tree of G. At each stage, the branches of the fundamental circuit 
defined by the link belonging to T* (step #3) are predecessors of that link in the list. Hence, T 
is circuit minimal and thus minimal. 

Remark. In every step, the branches and links are permanent. We do not have to know the 
edges beforehand as long as we process them one by one in ascending order. The rank of the 
graph (number of branches in a spanning tree) is then required beforehand so we know when to 
stop. 

Kruskal's Algorithm No. 2 

Here we assume the edges are given in an arbitrary order. 

1. Set k <- 1 and A «- 0. 

2. If (A U {e k }) contains no circuits, then set A A U {e k } as well as k k + 1 and go 
to step #4. 

3. If (A U {e k }) contains a circuit C, then choose the edge with the largest weight e in C (if 
there are more than one, take any), set A (A U {e k }) — {e} as well as k k + 1 and 
go to step #4. 

4. If k < m, then go to step #2. Otherwise, stop and output the spanning tree T = (A). 

Whenever we leave out an edge from A (step #3), its end vertices are already connected in A. 
Thus, the vertices of G are connected in T as they are in G. Since T is obviously circuitless 
(step #3), it is a spanning tree of G. 

We see that T is circuit minimal (and minimal) by the following logic. During the whole 
process, (A) is a forest by step #4. In addition, if u and w are connected in (A) at some point, 
then they are also connected afterwards. The u-w path in (A) is unique but it can change to 
another path later in step #3. Nevertheless, whenever this change occurs, the maximum value 
of the weights of the edges of the path can not increase anymore. Every link c of T* has been 
removed from A in step #3. Then, the weight of c is at least as large as the weights of the other 
edges in C, After we have gone through step #3, the only connected end vertices of c in (A) 
have to go through the remaining edges of C. The final connection between the end vertices of 
c in T goes through the edges of the fundamental circuit defined by c. Therefore, the weights 
of the edges of this fundamental circuit are < ct(c). 

Remark. In each step, the links ( e in step #3 ) are permanent and the branches are not. We do 
not have to know the edges beforehand as long as we process them one by one. However, we 
need to know the nullity of the graph (number of links in a cospanning tree) so that we know 
when to stop. The algorithm can also be used to update a minimal spanning tree if we add edges 
to the graph or decrease their weight. 
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Kruskal's Algorithm No. 3 

Here we assume the edges are given in descending order by weight. 

1. SetA^E and k 1. 

2. If (V, A — {e k }) is connected, then set A A — {e fc } as well as k «— k + 1 and go to 
step #4. 

3. If (V, A — {e k }) is disconnected, then set k <— k + 1 and go to step #4. 

4. If (V, A) is not a tree, then we go to step #2. Otherwise we stop and output the spanning 
treeT = (V,A). 

T is obviously connected because {V, A) is connected everytime we go to step #4. On the other 
hand, T is circuitless because if the circuit C is in T and the edge c is in the circuit, then c is 
removed from A in step #2 when e k = c (y/ ). Thus, T is a spanning tree of G. In each step, the 
links of the fundamental cut set defined by the branch belonging to T (step #3) are predecessors 
of that branch in the list. Hence, T is cut minimal and it is thus minimal. 

Remark. In each step, the branches and links are permanent. We have to know the edges 
beforehand. On the other hand, we do not have to know their weights as long as we get them 
one by one in descending order. 

Kruskal's Algorithm No. 4 

Here we assume the edges are given in an arbitrary order. 

1. Set A <- E and k <- 1. 

2. If (V, A — {e k }) is connected, then set A A — {ek} as well as k k + 1 and go to 
step #4. 

3. If (V, A — {ek}) is disconnected, then it has two components. The corresponding vertex 
sets form a cut (Vi, V 2 ). We interpret it as an edge set and choose the edge e with the 
lowest weight in (Vi, V 2 ) (if there are more than one, take any). Set A (A — {e k }) U{e} 
as well as k k + 1 and go to step #4. 

4. If k < m, then go to step #2. Otherwise stop and output the spanning tree T = (V, A). 

T is obviously connected because {V, A) is connected everytime we go to step #4. (Take note 
that the connectivity is preserved everytime we go through step #3.) On the other hand, T is 
circuitless. If a circuit C of G ends up in T and c is the edge of C, which is first in the list, 
then c must be removed from A in step #2 when e k = c. (Note that the edge removed from the 
circuit first can not be removed in step #3.) If c comes back later (in step #3), then it forms a 
cut set of (V, A) by itself in which case some other edge of C has been removed. By continuing 
this process, we see that all the edges of C can not be in A in the end (-y/ ). Therefore, T is a 
spanning tree of G. 

In addition, T is cut minimal and minimal because every branch b of T comes in in step 
#3. The links of the fundamental cut set defined by b are either edges of the cut (Vi, V2) which 
is examined at that point or they are links of the cuts we examined later in step #3. Whenever 
an edge of this kind gets removed later in step #3, it is always compensated by edges that are 
heavier in weight than b. Those heavier edges are in the cut (Vi, V 2 ) which is examined at that 
time. Therefore, the weights of the fundamental cut set defined by b are > a(b). 
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Remark. In each step, the branches (e in step #3) are permanent and the links are not. We 
have to know the edges beforehand. We do not have to know the weights beforehand as long as 
we process them one by one. This algorithm can also be used for updating a minimal spanning 
tree if we remove edges from a graph or if we increase the weights of edges. 

Prim's Algorithm 

In Prim's Algorithm (also known as Jarnik's Algorithm), we use the all- vertex incidence ma- 
trix of G. If we label the set of edges incident on vertex v as Q(v), then we can get a list 
Q(vi), . . . , 0,(v n ), i.e. the cuts defined by the vertices (interpreted as edge sets). In addition, we 
assign weights to the vertices. 

The algorithm works in the same way as Dijkstra's Algorithm by constructing the spanning 
tree branch by branch. The variables are A (set of branches of the spanning tree we have at the 
time), B (set of vertices of the spanning tree we have at the time) and I (the cut interpreted as 
an edge set from which we choose the next branch). 

Prim's Algorithm (First Version): 

1. Choose a starting vertex r and set A <— 0, B <— {r} as well as / ^— f2(r). 

2. Choose the lightest edge e from I (if there are more than one, choose any). Take the end 
vertex v of e that is not in B. Set A <- A U {e}, B <- B U {v } as well as I <- I © Q(v) 
and go to step #3. (Remember that © denotes the symmetric difference operation between 
two sets, see page 12.) 

3. If B V, then go to step #2. Otherwise, stop and output the spanning tree T = (B, A) = 

(A). 

Since the edge e was chosen from a cut, T is circuitless. On the other hand, because there is 
a path from r to every other vertex, T has every vertex of G and it is connected. T is thus a 
spanning tree. It is also minimal because 

Statement. During the whole process, (B, A) is a subtree of some minimal spanning tree ofG. 

Proof. We use induction on t, the number of vertices in B. 

Induction Basis : £ = 1. The case is obvious because (B, A) is trivial. 
Induction Hypothesis: The statement is true for £ — k — 1. (k > 2) 
Induction Statement : The statement is true for i = k. 

Induction Statement Proof : In step #2, we can write A = A' U {e}, where eel' and 
B = B' U {v}. (B', A') is a subtree of some minimal spanning tree T min from the induction 
hypothesis. If e belongs to T min , then the case is clear. Otherwise, there is a fundamental circuit 
C in T min + e and there is another edge e' of /' in C (Theorem 2.6). Then, a(e') > a(e) and 
(^min + e) — e' is also a minimal spanning tree and (B, A) is its subtree (because T min is circuit 
minimal and a (e') < a(e)). □ 

Often, we use one or two additional labels for the vertices to make Prim's Algorithm easier. 
In the next version of the algorithm, we will use two labels n(v) and (3(v), which are used to 
perform step #2 more effectively. The values of % are weights (up to oo) and the values of (3 are 
edges (or = 0). Otherwise, the algorithms works in the same way as before. 
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Prim's Algorithm (Second Version): 

1. Choose a starting vertex r and set n(r) <— 0. For every other vertex v, set n(v) <— oo. 
For every vertex v, set (5{v) <— as well as A ^— and £? <— 0. 

2. Choose a vertex u ^ B for which 

7r(u) = min{7r(f )}. 

Set B <- B U If /3(m) ^ 0, then set A <- A U {£(«)}. 

3. Go through all the edges e = (u, v) where v ^ B. If a(e) < n(v), then set 7r(t>) <— cc(e) 
and /3(f) <— e. 

4. If B V, then go to step #2. Otherwise, stop and output the spanning tree T = (B, A) = 

(A). 

5.7 The Lightest Hamiltonian Circuit (Travelling Salesman's 
Problem): The Annealing Algorithm and the Karp-Held 
Heuristics 

Problem. If it is possible, we are to find the Hamiltonian circuit with the lowest weight. A 
Hamiltonian circuit visits all the vertices of a graph. As usual, the weights of the edges have 
been assigned and the weight of a (directed) circuit is the sum of the weights of the edges 
traversed. 

Obviously, we can assume that the graph is nontrivial, connected (otherwise it would not be 
possible to get a Hamiltonian circuit) and simple. If not, then we simply remove all the loops 
and choose the edge with the lowest weight out of the parallel edges. As usual, we denote n as 
the number of vertices, m as the number of edges, V = {vi, . . . , v n } and E = {ei, . . . , e m }. 
We label the weight of an edge e = (v i: vf) as ct(e) = a(v i: vf) and the weight of a Hamiltonian 
circuit H as 'y(H). We agree that the "first" vertex of a Hamiltonian circuit is V\. 

The same problem exists for directed graphs in which case we are looking for the directed 
Hamiltonian circuit with the lowest weight (known as the Unsymmetric Travelling Salesman 's 
Problem). 

The Travelling Salesman's Problem (TSP) 1 is an MV -complete problem, read e.g. MEHL- 
horn for more information. Actually, even deciding the existence of a Hamiltonian circuit is 
an A/'P-complete problem. Solving a small TSP takes a lot of time and larger problems take so 
much time that it is almost impossible to obtain accurate solutions. Therefore, many stochastic 
and approximation methods are used in practice. Then, we have to accept the possibility of 
inaccurate outcomes or even the lack of results. 



'The name "Travelling Salesman's Problem" comes from an interpretation where the vertices of a graph are 
cities and the weights of the edges between the cities are travelling times. The salesman needs to visit every city 
in the shortest amount of time. 
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The Annealing Algorithm 

The annealing algorithms or thermodynamic algorithms have the following common features: 

(A) The system in question is always in some state s. The set of all states S is finite and 
known. In the TSP, a state is a Hamiltonian circuit. 

(B) Each state s has a response f(s), which can be calculated in a timely fashion from the 
state. Our goal is to find a state whose response is near the minimum/maximum value. 
The response of a state of a TSP is the weight of a Hamiltonian circuit. 

(C) There is a procedure A k which is used to move from state s to state A k (s). k is a parameter 
of the procedure which belongs to the set K. K can change during the procedure. The 
purpose is to move to certain states "near" the state s which are defined by the parameter 
k. By repeating the procedure with proper values of k, we should be able to move from 
any state to any other state. (In some cases, we can omit this last part.) 

(D) Every time we move from one state to another, we should be able to choose the param- 
eter k quickly and randomly from K. In particular, the set K itself should be easily 
computable. 

(E) We should be able to quickly perform the procedure A k given a value of k. 

(F) We should be able to find a starting state s . For the TSP, the starting state is a Hamiltonian 
circuit. 

The algorithm is as follows: 
The Annealing Algorithm: 

1. Choose the starting state s , the initial temperature T and set s <— s as well as T ^— T . 

2. When we are in the state s, we randomly choose a parameter k e K and compute s' = 

A k {s). 

3. If f(s') < f(s), then set s <— s' and go to step #5. 

f( s )~ f( s ') 

4. If f(s f ) > f(s) 9 then generate a random number r in the interval [0,1). If r < e t 9 

f(s)—f(s / ) 

then set s <— s . Thus, we accept a "worse" state with probability e t . Note that the 
greater the temperature T, the greater the probability that we go "uphill". 

5. If we have gone through a maximum total number of iterations, then we stop and output 
s. Otherwise, if we have gone through sufficiently many iterations of the procedure using 
temperature T, then we lower T by some rule and go to step #2. 

fja)-f(s') 

Remark. The distribution of the probability p s > = e t used in step #4 is (apart from 
normalizing) a so-called maximum entropy distribution with the following condition on the 
expected value: 

s'=A k (s) 
keK 

/(«')>/(») 

where p, depends on T and s. The distribution is also called a Boltzman distribution and it is 
analogous to the distribution of the same name in Statistical Mechanics. Refer to courses in 
Physics and Information Theory for more information. 
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At first, we wait until the fluctuation in the states settles to a certain equilibrium (using the 
response /(«)). After that, we lower the value of T a bit and wait again for the equilibrium. 
Then, we lower T again and so on. We continue this until the change in values of f(s) is 
sufficiently small or if we have ran out of time. 

The operation A k and the set K of the neighboring states depend on the problem. The state 
structure and the response function also depend on the problem. For the TSP, we still need to 
assign A k and K for every situation. For this purpose, we take another parameter j and set 
j <— 2. In step #2, we update j in the following way: 




j + 1 if j < n 
2 otherwise. 



(Another way of choosing j in step #2 would be to choose it randomly out of {2, . . . , n}.) 
Furthermore, we choose 

K = {2,...,n}-{j}. 
A k is defined by the following operation (known as the reversal): 

• If k > j, then we reverse the order of the vertices v ij: . . . ,v ik on the corresponding 
subpath in the current Hamiltonian circuit 

s : v 1 ,v i2 ,. . . ,v in ,v 1 . 

• If k < j, then we reverse the order of the vertices v ik) . . . ) v ij on the corresponding 
subpath in the current Hamiltonian circuit 

s : v 1 ,v i2 ,. . . ,v in ,v 1 . 

We add the missing edges to the graph with very large weights so that we get a complete 
graph and we will not have to worry about the existence of a Hamiltonian circuit in the first 
place. If we still do not get a Hamiltonian circuit in the end without those added edges, then 
there is not a Hamiltonian circuit. 

The starting temperature T should be much larger than the values of \f(s') — f(s)\ which 
guarantees that we can in principle move to any state ("annealing") in the earlier stages of the 
algorithm. After that, we lower the temperature applying some rule, for example a 10% change. 

The annealing algorithm also works for the unsymmetric TSP with obvious changes. 

Karp-Held Heuristics 

In the Karp-Held Heuristics, we do not directly look for a Hamiltonian circuit but look for a 
similar subgraph, known as a spanning 1-tree 2 . The process does not work for the unsymmetric 
TSP. The spanning 1-tree S v corresponding to the vertex v (known as the reference vertex) is a 
subgraph of G that satisfies the following conditions: 

(a) S v is connected and contains every vertex of G. 

(b) S v contains exactly one circuit C and the vertex v belongs to C. 

(c) S v has exactly two edges incident on v. 



2 Not to be confused with the 1-tree on p. 23! 
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Clearly, a Hamiltonian circuit is a spanning 1-tree corresponding to any of the vertices. The 
weight of the spanning 1-tree S v is the sum of of the weights of all its edges, denoted 7(5'„). S v 
is minimal if it has the lowest possible weight. 

Statement. S v is minimal if and only if 

(i) S v — v is a minimal spanning tree ofG — v, and 

(ii) the two edges of S v incident on v are the two lightest edges of G out of all the edges 
incident on v. 

Proof. Let S v be a minimal spanning 1-tree. Let e and e' be the two edges in S v incident on 
v. Then, S v — v is a spanning tree of G — v because removing v destroys the circuit but the 
connections remain unsevered. If S v — v is not a minimal spanning tree of G — v then there is 
a lighter spanning tree T of G — v . By adding the vertex v and the edges e and e' to T, we get a 
spanning 1-tree corresponding to vertex v which is lighter than S v (^/ ). Therefore, (i) is true. 
Obviously, (ii) is true (because otherwise we would get a lighter spanning 1-tree by replacing e 
and e' with the two lightest edges in G incident on v). 

Let us assume that (i) and (ii) are true. If S v is not minimal, then there is a lighter minimal 
spanning 1-tree S' v corresponding to v. Because S' v also satisfies (ii), the two edges incident on 
v are the same (or at least they have the same weight) in S v and S' v . Thus, S' v — v is lighter than 

S v -V(y/). " " □ 

It follows from the statement that any algorithm that finds the minimum spanning tree also 
works for finding the minimum spanning 1-tree with minor modifications. Especially, Kruskal's 
and Prim's Algorithms are applicable. 

In the Karp-Held Heuristics, we also use weights of vertices, denoted j3(v). With these, we 
can define the virtual weight of an edge as 

a'(vi, Vj) = a(v h Vj) + f3(vi) + f3(vj). 

With the concept of virtual weights, we get the virtual weight of a spanning 1-tree S v (we label 
the edge set of S v as A): 

(vi,vj)eA (vi,vj)eA (vi,Vj)eA 

(vi,vj)eA 

Now we denote the degree of the vertex u in S v as d Sv (u). Then, 

n 

X)o?(«i)+^)) = E^) d «.( w *) 

(vi,Vj)eA i=l 

and 

n 

i{S v )=r,{S v )+Y t fi{v i )d 8 ,{v i ). 

i=l 

In particular, if we have a Hamiltonian circuit H (a special spanning 1-tree), then 



d H (vi) = ■■■ = d H (v n ) = 2 
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and 

n 

1 '{H)= 1 {H) + 2Y,P{v l ). 

i=i 

N V ' 

Does not depend on H\ 

Minimization of the Hamiltonian circuits using virtual weights yields the same minimal circuit 
than obtained by using real weights. In general though, if we use virtual weights to search for 
spanning 1-trees, then we get results different from the spanning 1-trees obtained by using real 
weights. 

From now on, we only consider the spanning 1-tree corresponding to vertex v 1 and we leave 
out the subscript. This is not a limitation of any kind on the Hamiltonian circuits although it 
might be a good idea to change the reference vertex every now and then. We assume that H min 
is a minimal Hamiltonian circuit and S' is the minimal spanning 1-tree obtained from using 
virtual weights (which of course corresponds to v{). Then, 

In addition, 

n 

7'(#min) =7(#min)+2j]/3«) 

i=l 

and 

n 

1 '(S')= 1 (S') + J2^)ds'(v z ). 
i=\ 

Thus, 

n n 

7(#min) = Y(ifmin) - 2 ]T f3( Vl ) > 7 '(S') - 2 ]T f3(v,) 

i=l i=l 

n 

= l(S') + J2(3(v l )(ds>(v t )-2), 
i=\ 

from which we get a lower limit on ^(H min ). 

The idea of the Karp-Held Heuristics is to guide the degrees of the vertices in S' to the value 
2 by changing the weights of the vertices. If we succeed, then we get a minimal Hamiltonian 
circuit. In all cases, we get a lower limit on the weights 7(H) of the (possible) Hamiltonian 
circuits by using the calculation above. (Note that ^5/(^1) is always = 2 if S' is the spanning 
1-tree corresponding to vi.) 

The Karp-Held Heuristics: 

1. Set j3(v) ^—0 for every vertex v. 

2. Set a'(u, v) <— a(u, v) + f3(u) + (5(v) for each edge (u, v). 

3. Find the minimal spanning 1-tree S' using virtual weights a'(u, v). If we fail to find this 
kind of spanning 1-tree, then there is no Hamiltonian circuit and we can stop. 

4. If S' is a circuit, then output the minimal Hamiltonian circuit H — S' and stop. 

5. If S' is not a circuit and the lower limit calculated from S' increased during the last K 
iterations, then set f3(v) (3(v) + ds'(v) — 2 for every vertex v and go to step #2. (K is 
a fixed upper bound on the number of iterations.) 
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6. If the lower limit calculated from S' has not increased during the last K iterations, then 
output that lower limit and stop. 

This procedure does not always produce a minimal Hamiltonian circuit even if there exists 
one. In practice, it often produces either a minimal Hamiltonian circuit or a good lower limit on 
the weight of it. Getting a number for the lower limit does not, however, guarantee the existence 
of a Hamiltonian circuit in the graph! 

Karp-Held Heuristics has many steps where we have to choose between different options 
(such as the reference vertex and the spanning 1-tree). We can not go through every possibility 
so we must choose randomly. Then, we have a Las Vegas algorithm or stochastic algorithm. 



5.8 Maximum Matching in Bipartite Graphs: The Hungar- 
ian Algorithm 



A matching in the graph G = (V, E) is a set of edges S C E none of which are adjacent to each 
other. A matching is a maximum matching if it has the greatest possible number of edges. The 
end vertex of an edge in a matching is matched. 

Problem. We want to find the maximum matching in a bipartite graph. 

An alternating path of a matching S is a path that satisfies the following conditions: 

(1) The first vertex on the path is not matched, and 

(2) every second edge is in the matching and the remaining edges are not in the matching. 

Note that the first edge in an alternating path is not in the matching. In addition, if the last vertex 
of an alternating path is not matched, then this path is an augmenting path of S. A matching 
without augmenting paths is called a maximal matching. 

Example. For the bipartite graph 



one augmenting path of the matching S = {(vi,w 3 ), (f 3 ,w 2 ), (v4,w 6 ), (v 5 ,w 5 )} is the path 
where the vertices are t>2, W2, t>3, Wq, V4, W\. 





6 
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We can augment sl matching S using its augmenting path p as follows: 

1. We remove the edges of S in p, and 

2. We add the edges in p which are not in S. 

The new edge set is obviously a matching. Note that the number of edges in S on an augmenting 
path is one fewer than the number of the remaining edges. Therefore, the number of edges in 
a matching increases by one after the augmenting operation. It is not possible to augment a 
maximal matching. 

Example. (Continuing from the previous example) By using the given augmenting path from 
the matching S, we get a new maximal matching Si = {(vi,w 3 ), (v 2 ,W2), (v 3 ,w 6 ), (i>4, itfi), 




In the Hungarian Algorithm, we systematically search for augmenting paths until we get 
a maximal matching. After that, it suffices to prove that a maximal matching is a maximum 
matching. From now on, we only consider bipartite graphs because the algorithm is then much 
simpler. We search for augmenting paths by constructing an alternating tree of a matching S 
which is a subtree of G such that 

(1) a vertex r (the root of the tree) is unmatched, 

(2) every second edge on each path out from r is in S and the remaining edges are not in S, 
and 

(3) either there is an augmenting path out from r or we can not add any more edges to S. 

An alternating tree is an augmenting tree if it has an augmenting path. Otherwise, it is a Hun- 
garian tree. Every augmenting path is obviously an augmenting tree by itself. Note that the 
only unmatched vertex of a Hungarian tree is the root. 

Example. ( Continuing from the previous example) Two alternating trees of the matching S are 
( the root is circled): 
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Both of them are augmenting trees. An alternating tree of the matching S\ (the root is W4) is the 
Hungarian tree 




Augmenting and Hungarian trees are not unique. We can have many different trees depend- 
ing on the order we take the edges for constructing the trees even though the roots are the same. 
On the other hand, 

Statement. If a matching in a bipartite graph G has a Hungarian tree, then it does not have an 
augmenting tree with the same root. 

Proof. Let us prove by contradiction and consider the counter hypothesis: A matching S has a 
Hungarian tree U and an augmenting tree T with the same root r. We get an augmenting path 
in the augmenting tree 

p:r = v ,e 1 ,v 1 ,e 2 , . . -,e k ,v k . 

We choose the last vertex v,i which is in U from the path p (at least r = v is in U). Since 
v k is not in U, i < k. Furthermore, the edge e i+ i is not in the matching nor in U (otherwise 
v i+ i would also be in U). On the other hand, since e i+ i is not in U, v i+ i has to be an end 
vertex of another edge in U (s/ ) because the only reason why the edge ej + i is not put into U 
while constructing U is that the other end vertex t> i+1 of e i+ i is already in U . Note how the 
bipartiteness of the G comes in: If the cut in G that results in the bipartition is (Vi, V 2 ), then the 
vertices of U and p alternate between V\ and V 2 . Therefore, the length of the r-Vi path is even 
in p and U. □ 

Constructing an alternating tree from a root always leads to a Hungarian tree or an augmenting 
tree but not both. The order of the edges taken does not matter. (This is not the case for general 
graphs.) 

For the bipartite graph G = (V, E), the Hungarian Algorithm is as follows. The cut that 
yields the bipartition is (Vi, V2). 

The Hungarian Algorithm: 

1. Set 5^—0. (We can also use some other initial matching.) 

2. If every vertex in V\ or in V 2 is matched in S, then S is a maximum matching and we 
stop. 

3. If there are unmatched vertices in S of V\, then go through them in some order construct- 
ing alternating trees (the method of construction is not important as we claimed). If there 
is an augmenting tree, then augmenting the matching S by using the augmenting path we 
have another matching Si. Set S ^— Si and go to #2. 

4. If all the alternating trees that have unmatched vertices in Vi as roots are Hungarian, S is 
maximal and we stop. 
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Theorem 5.3. A maximal matching in a bipartite graph is a maximum matching. 

Proof. Let us prove by contradiction and consider the counter hypothesis: A maximal matching 
S in the bipartite graph G = (V, E) is not a maximum matching. Then, there are more edges in 
the maximum matching S mSLX than in S and in V\ there are more vertices matched in S max than 
in S. We choose an arbitrary vertex v e V\, which is matched in S max but not in S. Then, we 
have a path 

p : v = v , ei, vi, e 2 , . . . , e fc , w fe = tu, 

whose edges are alternating between S max and S, i.e. ei is in S max and e 2 is in S and so on. 
We choose the longest such path p. Because p is obviously an alternating path of S, it has 
even length, i.e. e k is an edge of S. (Otherwise, p would be an augmenting path of S which is 
impossible because S is maximal.) Thus, w is matched in S but not matched in S max (because 
the path p can not be continued). 

Hence, every vertex v e V\ which is matched in S max but not in S corresponds to a vertex 
w G Vi, which is matched in S but not in S max . Now, every path that ends at w must start 
from the vertex v if the starting vertex is matched in S max but not in S. The last edge of such a 
path has to be e k (the only edge in S incident on w) and the second to the last vertex has to be 
Vk-i. Furthermore, the second to the last edge of this path has to be e k -i (the only edge of S max 
incident on Vk-i) and the third to the last vertex has to be v k -2, and so on. 

However, there are then in Vi at least as many vertices w that are matched in S but not in 
'S'max as there are vertices v that are matched in S max but not in S (y/ ). □ 

Corollary. The Hungarian algorithm produces a maximum matching in a bipartite graph. 

A matching is perfect if it matches every vertex of a graph. Thus, a graph with an odd 
number of vertices can not have a perfect matching. Let us consider the graph G = (V, E) and 
denote v{v) = {adjacent vertices of v} as well as v(A) = [j veA v(v) for the vertex set A C V. 
Let us denote by the number of elements in the set X (the cardinality of the set). With 

these notions, we can present the following famous characterization: 

Theorem 5.4. (Hall's Theorem or "Marriage Theorem") A bipartite graph G whose bipar- 
tition cut is (Vi, V2) has a perfect matching if and only if every vertex set A C V\ and B C V2 
satisfies the conditions # (A) < #(i/(A)) and #(B) < #(i/(B)). 

Proof. If a perfect matching exists, then obviously < #(v(A)) and #(5) < #(v(B)) 

hold for all sets of vertices A C V 1 and B C V 2 . (Otherwise, we can not find a pair for every 
vertex in A or B in the matching.) 

Let us assume that for all sets of vertices A C V\ and B C V 2 , #(A) < #(y{A)) and 
< #(^(-3))- Let S be a maximum matching in G. We will prove by contradiction 
and consider the counter hypothesis: S is not perfect. We choose a vertex v which is not 
matched in S. Let us examine the case where v e Vi (the other case where v G V 2 is obviously 
symmetrical). The contradiction is apparent if v is an isolated vertex so we can move to the case 
where v is an end vertex of an edge. The alternating tree with the root v is then nontrivial and 
since the matching is also maximal, this tree is Hungarian. We choose such a Hungarian tree U. 
We label the set of vertices of V\ (resp. V 2 ) in U by A (resp. by B). Because of the construction 
ofU,B — v(A). On the other hand, the vertices of A and B in U are pairwise matched by the 
edges of S, except for the root r. Hence, = #(5) + 1 > #(5) (y/ ). □ 



CHAPTER 5. GRAPH ALGORITHMS 



80 



5.9 Maximum Flow in a Transport Network: The Ford-Ful- 
kerson Algorithm 

A transport network is a directed graph G = (V, E) with weighted arcs that satisfies the fol- 
lowing: 

(1) G is connected and loopless, 

(2) G has only one source s, 

(3) G has only one sink t, and 

(4) the weight c(e) of the arc e is called the capacity and it is a nonnegative real number, i.e. 
we have a mapping c : E — > R - 

(Compare to stationary linear networks in Section 4.4.) Actually, we could assume that G has 
every possible arc except loops and it can even have multiple parallel arcs. If this is not the case, 
then we simply add the missing arcs with capacity zero. Naturally, we can also assume that G 
is nontrivial. 

A flow f of a transport network is a weight mapping E — > M , which satisfies: 

(i) For each arc e, we have the capacity constraint /(e) < c(e), and 

(ii) each vertex v ^ s,t satisfies the conservation condition (also called Kirchhoff's Flow 
Law, compare to Section 4.4) 

E /( e ) = E 

initial vertex terminal vertex 
of e is v of e is v 

/(e) is called the flow of e. The flow of the arc (u, v) is also denoted as f(u, v). The value of 
the flow / is 

m=E/( e )- 

initial vertex 
of e is s 

A flow /* is a maximum flow if its value is the largest possible, i.e. |/*| > |/| for every other 
flow /. 

An s-t cut of a transport network S is a (directed) cut I — (Vi, V 2 ) such that s is in V\ and 
t is in V 2 . The capacity of such a cut is 

c(J) = E c(u,v). 

vev~2 

(Note that the arcs in the direction opposite to the cut do not affect the capacity.) The capacity of 
the cut (Vi, V 2 ) is also denoted as c(Vi, V 2 ). Furthermore, we define the flux of the cut I = (Vi, 
V>) as 

ugVi 

and the counter-flux as 

r(/) = E^)- 

U6V2 

The value of a flow can now be obtained from the fluxes of any s-t cut: 
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Theorem 5.5. Iff is a flow of a transport network and I is an s-t cut, then 

|/| =/+(/) -/"(/). 

Proof. Obviously, 



initial vertex terminal vertex 
of e is v of e is v 



' v 7^ s,t. 



We denote I — (Vi, V 2 ). By going through the vertices v in V\ and by adding up the equations 
we get 

v£Vi initial vertex vdV\ terminal vertex 

of e is v of e is v 

For each arc e whose end vertices are both in V\, /(e) and —/(e) are added exactly once and 
thus they cancel out. Therefore, 

= 1/1- □ 

vev 2 vev-i 

Corollary. Iff is a flow of a transport network and I is an s-t cut, then \ f\ < c(I). 

Proof. |/| = /+(/) - /"(/) < /+(/) < c(J). □ 

An arc e of a transport network is saturated if /(e) = c(e). Otherwise, it is unsaturated. 
Now, we point out that |/| = c(Vi, V 2 ) if and only if 

(i) the arc (u, v) is saturated whenever u e Vi and f G V 2 , and 

(ii) /(-u, f ) = whenever -u G V 2 and i; G V\. 

An s-t cut /* of a transport network is called a minimum cut if c(/*) < c(J) for every other 
s-t cut /. 

Corollary. If f is a flow of a transport network, I is an s-t cut and \ f\ = c(I), then f is a 
maximum flow and I is a minimum cut. 

Proof. If /* is a maximum flow and I* is a minimum cut, then |/*| < c(J*) by the corollary 
above. Thus, 

i/i < in < c(n < c(i) 

and / is indeed a maximum flow and I is indeed a minimum cut. □ 

Actually, the value of the maximum flow is the capacity of the minimum cut. The show this, we 
examine a path from vertex s to vertex v (not necessarily a directed path): 

s = v ,ei,vi,e 2 , ...,e k ,v k = v (pathp). 

If t{ = (vi-i,Vi), then the arc e^ is a forward arc. If e-i = (vi, v j_i), then the arc e^ is a back arc. 
The arc e { of p is now weighted by the following formula: 



e ( e i) 



c(ej) — /(ej) if e« is a forward arc 
/(ej) if ej is a back arc 
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and the path p is weighted by the following formula: 

k 



e{p) = min{e(ej)}. 

i=l 



The path p is unsaturated if e(p) > 0, i.e. all of the forward arcs of p are unsaturated and 
f(ei) > for all the back arcs e, of p. 

In particular, an s-t path can be unsaturated in which case it is called an augmenting path 3 . 
All of these definitions are of course attached to a certain flow /. By starting from an s-t path 
p (and a flow /), we can define a new flow: 



/ is really a flow. Changes in / can only occur at the arcs and vertices of p. Every arc of 
p satisfies the capacity constraint because of how e(p) and /(e) are defined. A vertex vi of p 
satisfies the conservation condition which we can verify. We have four cases: 




Obviously (think about the source s) 



so / is not a maximum flow if it has an augmenting path. Moreover, the converse is true as well. 



Theorem 5.6. A flow is a maximum flow if and only if it does not have any augmenting path. 

Proof. As we claimed, a maximum flow can not have an augmenting path. Let us assume that 
a flow / does not have an augmenting path. We denote the set of vertices which we can reach 
from the source s along unsaturated paths by V\. Then, trivially, s E V\ and t ^ V\ (because 
there are no augmenting paths). Thus, the cut / = (VI, V?) is an s-t cut. We proceed to prove 
that |/| = c(J). By the previous corollary, / is then a maximum flow. 

Let us consider the arc (u, v), where u G V\ and v E V 2 . Then, there exists an unsaturated 
s-u path p. The edge (it, v) is saturated because there would be an unsaturated s-v path oth- 
erwise. Similarly, we conclude that f(u,v) = for every arc (u,v), where u E V 2 and 
v E V\. Therefore, the flux f + (I) is c(J) and the counter- flux f~(I) is zero. By Theorem 




/(e) + e(p) if e is a forward arc of p 
/(e) — e(p) if e is a back arc of p 
/(e) otherwise. 



\f\ = \f\+<p) 



Hence, 



5.5, |/| = c(J). 



□ 



3 Not to be confused with the augmenting path in the previous section! 
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We have also proven the celebrated 

Theorem 5.7. (Max-Flow Min-Cut Theorem) The value of a maximum flow in a transport 
network is the same as the capacity of a minimum cut. 

If the capacities of the arcs are rational numbers, then a maximum flow can be found by 
using Theorem 5.6. The algorithm tries to find an augmenting path for /. If it can not be found, 
then we have a maximum flow. If we find an augmenting path, then we use it to create a greater 
flow /. In the algorithm, we use a label a for the vertices in the following way: 

a(v) = (u, direction, A), 

where u is a vertex in the transport network (or — if it is not defined), "direction" is either 
forward (— >) or back (■<— ) (or — if it is not defined) and A is a nonnegative real number (or 
oo). The point is, whenever a vertex v is labeled, there is an s-v path p which contains the 
("directed") arc (u, v) and A = e(p). A direction is forward if an arc is in the direction of the 
path and back otherwise. We can label a vertex v when the vertex u has been labeled and either 
(u, v) or (v , u) is an arc. We have two cases: 

(1) {Forward Label) If e = (w, v) is an arc and a(u) = (•, •, A u ) as well as c(e) > /(e), then 
we can write a(v ) = (u, — >, A v ), where 

A„ = min{A u ,c(e) - /(e)}. 

(2) (Back Label) If e = (v, u) is an arc and a(u) = (•, •, A„) as well as /(e) > 0, then we 
can write ct(v) = (u, <— , A„), where 

A„ = min{A u ,/(e)}. 

There are two phases in the algorithm. In the first phase, we label the vertices as presented 
above and each vertex is labeled at most once. The phase ends when the sink t gets labeled as 
a(t) = (•, — >, A t ), or when we can not label any more vertices. In the second case, there are no 
augmenting paths and the flow we obtain is a maximum flow so we stop. In the first case, the 
flow we obtain is not a maximum flow and we have an augmenting path p for which e(p) = A t . 
The algorithm moves on to the second phase. In the second phase, we construct a new greater 
flow / by using the labels of the vertices of p obtained previously. After this, we go back to the 
first phase with this greater new flow. 

The Ford-Fulkerson Algorithm: 

1 . Choose an initial flow / . If we do not have a specific flow in mind, we may use / (e) = 0. 
Label the source s by ct(s) ^— (— , — , oo). Set / ^— / . 

2. If we have a unlabeled vertex v, which can be labeled either forward by (w, — >, A„) or 
backward by (w, , A„), then we choose one such vertex and label it. (There can be 
many ways of doing this and all of them are permitted.) If such a vertex v does not exist, 
output the maximum flow / and stop. 

3. If t has not been labeled, go to step #2. Otherwise, set u <— t. 
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4. If a(u) = (w, — >, A u ), then set 

f(w,u)-^f(w,u) + A t and u-^w. 
If a(u) = (w, <—, A u ), then set 

f(u,w)^-f(u,w) — A t and u^w. 

5. lfu = s, then remove all the labels a but not the label of the source and go to step #2. If 
u ^ s, then go to step #4. 

If fo(e) and c(e) are rational numbers, then the algorithm stops and produces a maximum flow. 4 
In this case, we can assume that these weights and capacities are nonnegative integers. Thus, 
the value of a flow increases by a positive integer every time we move from the second phase to 
the first phase and the value reaches a maximum eventually. On the other hand, the number of 
steps can be as large as the value of the maximum flow. The performance time of the algorithm 
does not only depend on the number of vertices but also the capacities. 

The algorithm can be modified 5 so that it does not depend on the capacities. Thus, it will 
work for irrational capacities. In this case, our purpose during the labeling phase is to find the 
shortest augmenting path. We get this by always choosing the vertex v in step #2 in such a way 
that in a(v ) = (w, A„), w received its label as early as possible. 

The Ford-Fulkerson Algorithm also works for finding a maximum matching in a bipartite 
graph. Let us do an example: 

Example. Using the bipartite graph G from an example in the previous section, we get a trans- 
port network G': 




Every edge of G' is directed from left to right and given a capacity ofl. The initial flow is a 
zero flow (or a greater flow we obtain from some other initial flow). During the whole process, 
the flows of the edges are integers or 1. We take into the matching those edges in G whose 
corresponding edges e in G' receive a flow /(e) = 1 and a maximum flow gives a maximum 
matching. Note that an augmenting path can be of length larger than three in this case. (We 
can also claim now that the augmenting paths here and the augmenting paths obtained from the 
Hungarian Algorithm do have something in common after all! ) 



4 If there are irrational capacities or flows /o(e), then the algorithm may not stop at all and it may not produce a 
maximum flow even if the process repeats endlessly. Of course, we do not have to use irrational flows. In practice, 
we will not use irrational capacities. 

5 This is known as the Edmonds-Karp Modification (refer e.g. to Swamy & THULASIRAMAN). 



Chapter 6 
Drawing Graphs 



6.1 Planarity and Planar Embedding 

We have not treated graphs as geometric objects so far in the course. In practice, we draw 
graphs, i.e. we treat vertices as geometric points and edges as continuous curves. If a graph 
G can be drawn on a plane (or a sphere) so that the edges only intersect at vertices, then it is 
planar. Such a drawing of a planar graph is a planar embedding of the graph. 

A connected part of a plane which does not contain any vertices and is surrounded by edges 
is called a region of a planar embedding. In addition, the part outside the embedding is consid- 
ered as a region, known as the exterior region (when we draw a planar graph on a plane or on a 
sphere, it is just like any other region). The vertices surrounding a region s are called boundary 
vertices and the edges surrounding s are called boundary edges. Two regions are adjacent if 
they share a boundary edge. Note that a region can be adjacent to itself. 

Example. In the following planar embedding 




the regions are s\, s 2 , S3, S4 and S5 (the exterior region) and their boundary vertices and edges 
as well as their adjacent regions are given in the table below: 



region boundary vertices boundary edges adjacent regions 







Vl,V 5 ,V 2 






ei,ei ,e 2 




S2, S5 


«2 




V5,V4,V3,V 6: 




e 2 , 


e 4 ,e 7 ,e 9 ,e 8 , 


e 6 


s l, S2, S3, S5 


S 3 










e 4 ,e 5 




S2, S5 


S 4 




V 5 






e 3 




S5 


S5 






V8 


eio 


, e3,e 5 ,e 7 ,e 6 . 




Sl,S 2 ,S 3 ,S 4 
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In the following, we investigate some fundamental properties of planar embeddings of 
graphs. 

Theorem 6.1. (Euler's Polyhedron Formula 1 ) If a planar embedding of a connected graph G 
has n vertices, m edges and f regions, then 

f + n = m + 2. 

Proof. Let us use induction on m. 

Induction Basis : m — 0. Planar embedding of G has only one vertex and one region (the 
exterior region) so the claim is true. 

Induction Hypothesis: The theorem is true for m <£.(£> 0) 

Induction Statement : The theorem is true for m — t + 1. 

Induction Statement Proof : We choose an edge e of G and examine the graph G' = G — e. 
If e is in a circuit, then G' is connected and by the Induction Hypothesis, we get 

/' + n = (m - 1) + 2, 

where /' is the number of regions in G' . However, closing the circuit with e increases the 
number of regions by one so /' = / — 1 and the theorem is true. If G — e is disconnected, 
then it has two planar components, G 1 and G 2 whose number of vertices, edges and regions are 
rii, n 2 , mi, m 2 , fi and f 2 , respectively. By the Induction Hypothesis, 

fi + ni — mi + 2 and f 2 + n 2 = m 2 + 2. 

While adding e, the number of regions becomes fi + f 2 — 1 (Gi and G 2 share the same exterior 
region or one exterior region is drawn to be a region of the other component), the number of 
vertices becomes n\ + n 2 and the number of edges becomes mi + m 2 + 1. Hence, the claim is 
true. □ 

Example. (Continuing from the previous example) We remove the vertex v$ to get a connected 
planar embedding. Now, we have 7 vertices, 10 edges, 5 regions and 5 + 7= 10 + 2. 

Theorem 6.2. (The Linear Bound) If a simple connected planar graph G has n > 3 vertices 
and m edges, then 

m < 3n — 6. 

Proof. If the regions of a planar embedding of G are «i, . . . , Sf, then we denote the number of 
boundary edges of s, by r; (i = 1, . . . , /). The case / = 1 is obvious because G is then a tree 
and m = n— l<3n — 6. Thus, we assume that / > 2. Since G is simple, every region has at 
least 3 boundary edges and thus 

/ 

i=i 

Every edge is a boundary edge of one or two regions in the planar embedding, so 

/ 

Ti < 2m. 

i=i 

The result now follows directly from Euler's Polyhedron Formula. □ 
'The name comes from a polyhedron with n vertices, m edges, / faces and no holes. 
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Theorem 6.3. (The Minimum Degree Bound) For a simple planar graph G, 5(G) < 5. 

Proof. Let us prove by contradiction and consider the counter hypothesis: G is a simple planar 
graph and 5(G) > 6. Then, (by Theorem 1.1) m > 3n, where n is the number of vertices and 
m is the number of edges in G. {y/ Theorem 6.2) □ 

A characterization of planar graphs is obtained by examining certain forbidden subgraphs. 

Theorem 6.4. (Kuratowski's Theorem) A graph is planar if and only if none of its subgraphs 
can be transformed to K 5 or K 3 3 by contracting edges. 

The proof is quite complicated (but elegant!), refer e.g. to Swamy & Thulasiraman for 
more information. K 5 and K 3 3 are not planar, which can be verified easily. 

There are many fast but complicated algorithms for testing planarity and drawing planar 
embeddings. For example, the Hopcroft-Tarj an Algorithm 2 is one. We present a slower classi- 
cal polynomial time algorithm, the Demoucron-Malgrange-Pertuiset Algorithm 3 (usually just 
called Demoucron's Algorithm). The idea of the algorithm is to try to draw a graph on a plane 
piece by piece. If this fails, then the graph is not planar. 

If G is a graph and R is a planar embedding of a planar subgraph S of G, then an R-piece 
P of G is 

• either an edge of G — S whose end vertices are in S, or 

• a component of the subgraph induced by vertices not in S which contains the edges (if 
any) that connect S to the component, known as pending edges, and their end vertices. 

Those vertices of an i?-piece of G that are end vertices of pending edges connecting them to 
S are called contact vertices. We say that a planar embedding R of the planar subgraph S is 
planar extendable to G if R can be extended to a planar embedding of the whole G by drawing 
more vertices and/or edges. Such an extended embedding is called a planar extension of R to 
G. We say further that an .R-piece P of G is drawable in a region s of R if there is a planar 
extension of it! to G where P is inside s. Obviously all contact vertices of P must then be 
boundary vertices of s, but this is of course not sufficient to guarantee planar extendability of R 
to G. Therefore we say that a P is potentially drawable in s if its contact vertices are boundary 
vertices of s. In particular, a piece with no contact vertices is potentially drawable in any region 
of R. 

Demoucron's Algorithm: 

1. We first check whether or not G is a forest. If it is a forest, then it clearly is planar and 
can be planar embedded. (There are fast algorithms for this purpose.) We can then stop. 

2. If G is not a forest then it must contain at least one circuit. We choose a circuit C, embed 
it to get the planar embedding D, and set R <— D. (A circuit is obviously planar and is 
easily planar embedded.) 

3. If R is a planar embedding of G, then we output it and stop. 

2 The original reference is HOPCROFT, J.E. & Tarjan, R.E. : Efficient Planarity Testing. Journal of the ACM 
21 (1974), 549-568. 

3 The original reference is DEMOUCRON, G. & MALGRANGE, Y. & PertuisET, R.: Graphes planaires: recon- 
naissance et construction des representations planaires topologiques. Revue Frangaise Recherche Operationnelle 
8 (1964), 33^7. 
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4. We construct the set V of all i?-pieces of G. For each piece P G V we denote by S(P) 
the set of all those regions of R which P is potentially drawable in. 

5. If, for an i?-piece P e V, the set S(P) is empty then G is not planar. We can then output 
this information and stop. 

6. Choose an i?-piece P, starting from those potentially drawable only in one region. 

7. Depending on the number of contact vertices of P, we planar extend R: 

7.1 If P has no contact vertices, we call Demoucron's Algorithm recursively with input 
P. If it turns out that P is not planar, then G is not planar, and we output this 
information and stop. Otherwise we extend R to a planar embedding U by drawing 
P in one of its regions, set R 4- U, and return to step #3. 

7.2 If P has exactly one contact vertex v, with the corresponding pendant edge e, we 
call Demoucron's Algorithm recursively with input P. If it turns out that P is not 
planar, then G is not planar, and we output this information and stop. Otherwise we 
extend R to a planar embedding U by drawing P in a region with boundary vertex v, 
set R 4— U, and return to step #3. (This region of R will then be an exterior region 
of the planar embedding of P.) 

7.3 If P has (at least) two contact vertices v\ and v 2 , they are connected by a path p in 
P. We then extend R to a planar embedding U by drawing p in a region of R with 
boundary vertices v\ and t> 2 where P is potentially drawable, set R 4— U, and return 
to step #3. 

Clearly, if G is not planar, Demoucron's Algorithm will output this information. On the 
other hand, the algorithm will not get stuck without drawing the planar embedding if the input 
is planar, because 

Statement. If G is planar, then at each step of the algorithm R is planar extendable to G. 

Proof. We use induction on the number of times £ the algorithm visits step #7. 

Induction Basis : £ = 0. Now either G is a forest (and R is not needed) or R is a circuit of 
G. Obviously the planar embedding of this circuit can be planar extended to G. 

Induction Hypothesis: The statement is true for £ < r. (r > 0) 

Induction Statement : The statement is true for £ = r + 1. 

Incuction Statement Proof : For step #7.1 the matter is clear. If, in step #7.2, P is potentially 
drawable in the region s of R, it can always be drawn in this region without endangering subse- 
quent steps. In other words, any possible region can be chosen. This is because the region can 
be exchanged for another at all times by "reflection" with respect to the vertex v (and possibly 
rescaling): 
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Similarly, if in step #7.3, P is drawable in a region of R, then it can be drawn in this region 
without endangering subsequent steps. If P is drawable in both region s-y and region s 2 , its 
contact vertices are boundary vertices of both s\ and s 2 . At any time, a drawn P (or part of 
it) can be moved from region si to s 2 , or vice versa, simply by reflection with respect to the 
common boundary (and possibly rescaling to fit into the region). □ 

Remark. Nonplanar graphs may be embedded on closed continuous surfaces with holes. For 
instance, a torus is closed surface with exactly one hole. On a torus we can embed the non- 
planar graphs K 5 , Kq and K 7 , and also K 3 3 . K$ is more complex and its embedding requires 
a closed surface with two holes. The smallest number of holes in a closed surface required for 
embedding the graph G on it is called the genus ofG. On the other hand the smallest number of 
crossings of edges in a drawing ofG on plane is called the crossing number of G. Computation 
of genus and crossing number are both MV -complete problems. 

A coloring of a graph is a labeling of vertices where adjacent vertices never share a label. 
The labels are then often called colors. We say that a graph is k-colorable if it can be colored 
using (at most) k colors. If a graph is colorable then it obviously can not have loops. Equally 
obviously, parallel edges can be reduced to one, so we may assume our graphs here to be simple. 
The smallest number k for which the graph G is A;-colorable, is called the chromatic number of 
G, denoted by x(G). 

K 4 is an example of a planar simple graph which is not 3-colorable. On the other hand there 
is the celebrated 

Theorem 6.5. (The Four-Color Theorem) Every simple planar graph is A-colorable. 

Proof. The only known proofs require extensive computer runs. The first such proof was ob- 
tained by Kenneth Appel ja Wolfgang Haken in 1976. It takes a whole book to present the 
proof: Appel, K. & Haken, W.: Every Planar Map is Four Colorable. American Mathemat- 
ical Society (1989). □ 

If we require a bit less, i.e. 5-colorability, then there is much more easily provable result, 
and an algorithm. 

Theorem 6.6. (Hea wood's Theorem or The Five-Color Theorem) Every simple planar graph 
is h-colorable. 

Proof. We may think of G as a planar embedding. We use induction on the number n of vertices 
of G. 

Induction Basis : n—1. Our graph is now 1-colorable since there are no edges. 
Induction Hypothesis: The theorem is true for n <£.(£> 1) 
Induction Statement : The theorem is true for n = i + 1. 

Induction Statement Proof : According to the Minimum Degree Bound, there is a vertex v 
in G of degree at most 5. On the other hand, according to the Induction Hypothesis the graph 
G — v is 5-colorable. If, in this coloring, the vertices adjacent to v are colored using at most 
four colors, then clearly we can 5-color G. 

So we are left with the case where the vertices v 1: v 2: v 3: V4,v 5 adjacent to v axe colored 
using different colors. We may assume that the indexing of the vertices proceeds clockwise, 
and we label the colors with the numbers 1, 2, 3, 4, 5 (in this order). We show that the coloring 
of G — v can be changed so that (at most) four colors suffice for coloring v±, v 2 , v 3 , v 5 . 

We denote by H itj the subgraph of G — v induced by the vertices colored with i and j. We 
have two cases: 
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• vi and v 3 are in different components Hi and H 3 of _£/i i3 . We then interchange the colors 1 
and 3 in the vertices of H 3 leaving the other colors untouched. In the resulting 5-coloring 
of G — v the vertices v 1 and v 3 both have the color 1. We can then give the color 3 to v . 

• vi and t> 3 are connected in H\ :3 . Then there is a v±-vs path in H lj3 . Including the vertex 
v we get from this path a circuit C. Now, since we indexed the vertices v i, v 2 , v 3 , t> 4 , t> 5 
clockwise, exactly one of the vertices v 2 and w 4 is inside C. We deduce that v 2 and v± are 
in different components of if 2 ,4, and we have a case similar to the previous one. □ 

The proof gives a simple (recursive) algorithm for 5-coloring a planar graph, the so-called 
Heawood's Algorithm. 



6.2 The Davidson-Harel Algorithm 

For the actual drawing of a graph we need to define the drawing area (the "window"), i.e. a 
rectangular area with sides parallel to the coordinate axes, the drawing curve of the edges (here 
edges are drawn as line segments), and certain "criteria of beauty", so that the resulting drawing 
is pleasant to the eye, balanced, and as clear as possible. Such "beauty criteria" are of course 
context-dependent and even matters of individual taste. In the sequel we restrict ourselves to 
simple graphs, given by, say, an adjacency matrix or an all-vertex incidence matrix. 

We will now present the so-called Davidson-Harel Algorithm 4 which, applying an anneal- 
ing algorithm, aims at better and better drawings of a graph using a certain ugliness function 
(cf. Section 5.7). An ugliness function R computes a numerical ugliness value obtained from 
a drawing P of a graph G. This value is a sum of various contributing factors. We denote, as 
usual, the sets of vertices and edges of G by {v±, . . . , v n } and {ei, . . . , e m }, respectively. We 
also denote by v« the vector (or geometric point) corresponding to the vertex and by e,- the 
line segment corresponding to the edge e,-. Further, we denote 

dij = || Vi - Vj-||, 

Ti = distance of Vj from the right border of the window, 
li = distance of Vj from the left border of the window, 
Ui = distance of Vj from the upper border of the window, 
bi = distance of Vj from the lower border of the window, 
Cj = length of the line segment e^, 

1 1, if the line segments and e,- intersect without and ej being adjacent 
1 otherwise, 

{distance of V; from the line segment e, if it exceeds 7 and v* is not an end vertex of e, 
7 otherwise. 

7 is a parameter of the algorithm telling how close to vertices edges can be. The ugliness 
function is then given by 



4 The original reference is DAVIDSON, R. & HAREL, D.: Drawing Graphs Nicely Using Simulated Annealing. 
ACM Transactions on Graphics 15 (1996), 301-331. 
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where Ai, . . . , A 5 are nonnegative- valued parameters weighting the contributions of the various 
factors. (One could actually use negative values as well, whatever the interpretation then might 
be.) 

We can compute dij, . . . , gij quite easily using some basic formulae of vector geometry. We 
must, however, think about the speed of the computation as well. One way to speed up the 
computation is to use complex arithmetic, dij,..., g, L j are then equally easily computable. 5 It 
may also be of advantage to force the vertices into a lattice of geometric points. This can be 
achieved for instance by rounding the coordinates (or complex numbers) to a fixed accuracy 
and abandoning drawings where the ugliness function has the value oo (this happens e.g. when 
vertices occupy the same point). 

In the annealing process the state is P and the response is R{P). An initial state can be 
obtained by choosing the points Vi, . . . , v„ in the window randomly, and then drawing the 
edges accordingly. The state transition process P A p (P) is the following: 

• Choose a random vertex Vj. (Alternatively the vertices may be circulated cyclically.) 

• Draw a circle of radius p centered on Vj. The radius p is a parameter, which is initially 
large and gradually reduced later in some systematic fashion. 

• Chose a random point u on this circle. 

• If u is outside the drawing window, the state remains the same. Otherwise set Vj u 
and change the edges accordingly in the drawing. 

The remaing parts of the algorithm are very similar to the annealing algorithm for the TSP in 
Section 5.7. 

Remark. This method has numerous variants. The window could be a circle and the edges 
concentric arcs or radii. Or the window could be a sphere and edges drawn as arcs of great 
circles. The window could also be unbounded, for instance, the whole ofM. 2 . We could "draw" 
graphs three-dimensionally. Etc. We could also use a metric other than the Euclidean one when 
computing distances, e.g. the Manhattan metric ("1-norm") or the max-metric ("oo-norm"), 
geodetic distances on a sphere, etc. Needless to say, the resulting drawings are rather different 
using these variants of the algorithm. 

It may be noted that using nearly any effective criteria, finding the optimally pleasing draw- 
ing of a simple graph is an MV-hard problem. 



5 Note that if z\ = x\ + jyi and z 2 = x 2 + jj/2, where j is the imaginary unit, then the real part of z\z 2 equals 



the dot product (xi, y\) • (x2, 2/2) and the imaginary part equals the determinant 



X\ X2 

yi V2 



Chapter 7 
MATROIDS 



Many concepts in the preceding chapters do not so much deal with graphs themselves as their 
structural properties. Examples are various dualities (cut set vs. circuit), principles behind cer- 
tain algorithms (e.g. Kruskal's Algorithms), and various extremality properties (many structures 
are the "smallest of their kind", one cannot e.g. remove an edge of a cut set without it losing 
this property). 

Exactly corresponding structures were found in many other areas of mathematics, and they 
were called matroids. 1 

7.1 Hereditary Systems 

A hereditary family of sets is a family of sets such that whenever a set F is in the family then 
so are all subsets of F (and in particular the empty set 0). A hereditary system M of a set E 
is a nonempty hereditary family of subsets of E. Included there are also the various ways of 
specifying X M , called aspects. It will be assumed in what follows that E is a finite set. The 
following nomenclature is traditional: 

• Sets in the family X M are called independent sets of M. 

• The family of subsets of E other than those in X M is denoted by V M and called the family 
of dependent sets of M. 

• An independent set is maximal if it is not a proper subset of another independent set. A 
maximal independent set is called a basis. The family of all bases is denoted by Bm- Note 
that an independent set is always contained in a basis. 

• A dependent set is minimal if no dependent set is its proper subset. A minimal dependent 
set is called a circuit. 2 (Recall that the empty set is always in Zm-) The family of all 
circuits is denoted by Cm- Note that a dependent set always contains a circuit. 

• A circuit consisting of only one element is a so-called loop. Elements of a circuit with 
two elements are called parallel. A hereditary system is simple if it has no loops and no 
parallel elements. 

'The remarkable thing is that many of these structures were found independently at the same time around the 
year 1935: Hassler Whitney investigated planarity of graphs, Saunders MacLane geometric lattices of points, and 
Bartel van der Waerden's topic was independence in vector spaces. 

2 This or any other "familiar sounding" concept should not be confused with the corresponding concept for 
graphs, even though there is a certain connection, as will be seen! 
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• The rank of a subset F of E is the largest size of an independent set contained in F. 
(Recall that E is assumed to be finite.) Note that the empty set is always an independent 
set contained in F. The rank of F is denoted by p M (F), and p M is called the rank function 
of M. 

A notation similar to one used for graphs will be adopted in the sequel concerning adding 
an element e to the set F (denoted by F + e) or removing it from F (denoted by F — e). Two 
easy properties of the rank function are the following 

Theorem 7.1. If M is a hereditary system of the set E then 

(i) /0m (0) = 0, and 

(ii) for any subset F of E and any element e, 

Pm(F) < PM {F + e) < p M (F) + 1. 

Proof. Item (i) is clear, so let us move to item (ii). 

Since F + e contains those independent sets that are contained in F, we have Pm{F + e) > 
Pm(F). On the other hand, possible independent subsets of F + e not contained in F may only 
consist of an independent subset of F and e, so p M (F + e) < Pm(F) + 1. □ 

A hereditary system M may of course be specified by giving its independent sets, that is by 
giving X M - It can be specified as well by giving its bases, i.e. Bm, independent sets will then 
be exactly all subsets of bases. On the other hand, M can be specified by giving its circuits, 
i.e. Cm, independent sets are then the sets not containg circuits. Finally, M can be defined by 
giving the rank function p M , since a set F is independent exactly when Pm{F) = #(F). (As 
before, we denote cardinality of a set F by Thus an aspect may involve any of X M , B M , 

C M and p M - 

It might be mentioned that a hereditary system is a far too general concept to be of much 
use. This means that well chosen aspects are needed to restrict the concept to a more useful one 
(that is, a matroid). Let us have a look at certain proper aspects in connection with a matroid 
well familiar from the preceding chapters. 

7.2 The Circuit Matroid of a Graph 

The circuit matroid M(G) of a graph G = (V, E) is a hereditary system of the edge set E 
whose circuits are the circuits of G, considered as edge sets. (It is naturally assumed that G is 
not empty.) The bases of M(G) are the maximal independent edge sets, i.e. spanning forests 
of G, and the independent sets of M(G) are the subforests, both considered as edge sets. Let 
us denote Gf = (V, F) for a subset F of E. The number of vertices of G is denoted by n, as 
usual. 

Remark. A hereditary system that is not directly a circuit matroid of any graph but has a 
structure identical to one is called a graphic matroid. 

Let us then take a look at different aspects of the circuit matroid. 
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Basis Exchange Property 

Let us consider two bases (i.e. spanning forests) B\ and B 2 . If e is an edge in B%, its removal 
divides some component G' of the graph G into two disjoint subgraphs. Now certain edges of 
B\ will be the branches of a spanning tree T\ of G', and similarly, certain edges in B 2 will be 
the branches of a spanning tree T 2 of G'. The removed edge e is either a branch of T 2 or then a 
link of T 2 *. In the latter case e will be in the fundamental cut set determined by a branch / of T 2 
(cf. Theorem 2.7). Then T\ — e + / is also a spanning tree of G' and we can replace e by / and 
get again a spanning forest of G, that is, a basis. 

Hence we have 

Basis Exchange Property: lfB\ and B 2 are different bases and e G B\ — B 2 then there is an 
element f 6 B 2 — B\ such that B\ — e + / is a basis. 

In general, a hereditary system with the basis exchange property will be a matroid. In other 
words, the basis exchange property is a proper aspect. Using basis exchange one can move 
from one basis to another. All bases are thus of the same size. 

Uniformity. Absorptivity 

For a subset F of E let us denote by rip the number of vertices in the subgraph (F) of G 
induced by F, and by kp the number of its components. Then there are np — kp edges in a 
spanning forest of (F) . Let us denote further by Kp the number of components of the subgraph 
G F of G. Clearly then 

Pm{G){F) = n F - k F = n - K F , 
and all such spanning forests are of the same size. Hence 

Uniformity: For a subset F of E all maximal independent subsets of F are of the same size. 
(Maximality of a set means here that there are no independent sets J such that He J C F.) 

In general, a hereditary system with the uniformity property will be a matroid, and uniformity 
is a proper aspect. 

In the figure below continuous lines are the edges of F, with the thick ones being the 
branches of a spanning forest. Dashed lines indicate the remaining edges in E. 




If e is an edge of G and Pm(g)(F + e) = Pm(G)(F) men e does not connect two components 
of G F . Suppose / is another edge with the same property, that is, Pm(g)(F + /) = Pm{G){F). 
Clearly then 

Pm(g)(F + e + f) = p M (G)(F). 

Thus we get 

Weak Absorptivity: If e, f e E and F C E and 



Pm(F) = p M (F + e)= p M (F + f) 
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then also 

p M {F + e + /) = pm(F). 

In general a weakly absorptive hereditary system is a matroid, and thus weak absorptivity is 
another proper aspect. 

By repeating the above argument sufficiently many times we see that if F and F' are sets of 
edges of G, and for each edge e in F' we have Pm(g) (F + e) = Pm(g) (F), then 

p M (G)(FUF')= p M(G) (F). 

Hence also 

Strong Absorptivity: If F, F' C E and pm(F + e) = pu{F)for each element e in F' then 

Pm(FUF') = p M (F). 

We conclude that strong absorptivity is a proper aspect. 

Augmentation 

Suppose Ii and I 2 are independent sets of the circuit matroid M(G) (edge sets of subforests 
of G) and #(ii) < #(^2)- The subgraph then has n — components, and the subgraph 
G/ 2 has n — #(^2) components, so strictly less than G^. Adding an edge does not reduce the 
number of components exactly in the case where the edge is added in some component. Thus, 
if adding any edge in I 2 — h to G/ 1 preserves the number of components then it must be that 
the edge is added in some component of G/ 1 , and Gj 2 cannot have fewer components than G/ 1 . 
But as noted, this is not the case if #(ii) < #(h), and so 

Augmentation: If Ii and I 2 are independent sets of the hereditary system M and #(ii) < 
#(/ 2 ) then there exists an element e e I 2 - h such that I± + e is in X M . 

In general, a hereditary system with the augmentation property is a matroid. Thus augmentation 
is a proper aspect, too. 

Elimination 

The circuits of the circuit matroid M(G) are the edge sets of the circuits of G. The degree 
of a vertex in a circuit is two. If C\ and C 2 are different circuits of M(G) then the degree of 
a vertex of the ring sum (Ci) © (C 2 ) is also even, see Section 1.3. Hence (Ci) © (C 2 ) must 
contain at least one circuit as a subgraph, since a ring sum does not have isolated vertices and 
a nonempty forest has at least one pending vertex (Theorem 2.3). Recalling the definition of 
ring sum in Section 1.3 it is noticed that such a circuit does not contain edges in the intersection 
C\ fl C 2 , at least not with as high multiplicity as in C\ U C 2 . Thus 

Elimination Property: If C\ and C 2 are different circuits of the hereditary system M and 
e G C\ fl C 2 then there is a circuit C G Cm such that C C Ci U C 2 — e. 

Again, elimination property is a proper aspect, and a hereditary system with the elimination 
property is a matroid. 
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Induced Circuits 

If I is an independent set of the circuit matroid M(G) (edge set of a subforest) then adding 
one edge either closes exactly one circuit in a component of Gj (Theorem 2.3), or then it con- 
nects two components of Gj and does not create a circuit. We have then 

Property of Induced Circuits: If I is an independent set of a hereditary system M and e G E 
then I + e contains at most one circuit. 

The property of induced circuits is a proper aspect, and a hereditary system having this property 
will be a matroid. 

7.3 Other Basic Matroids 

Vectorial Matroid 

Let E be a finite set of vectors of a vector space (say R n ) and the independent sets of a 
hereditary system M of E be exactly all linearly independent subsets of E (including the empty 
set). M is then a so-called vectorial matroid. Here E is usually allowed to be a multiset, i.e. 
its elements have multiplicities — cf. parallel edges of graphs. It is then agreed that a subset of 
E is linearly dependent when one its elements has a multiplicity higher than one. A hereditary 
system that is not directly vectorial but is structurally identical to a vectorial matroid M' is 
called a linear matroid, and the matroid M' is called its representation. 

A circuit of a vectorial matroid is a linearly dependent set C of vectors such that remov- 
ing any of its elements leaves a linearly independent set — keeping in mind possible multi- 
ple elements. An aspect typical to vectorial matroids is the elimination property. If C\ = 
{r, ri, . . . , rfc} and C2 = {r, r[, . . . , r^} are different circuits sharing (at least) the vector r then 
r can be represented as linear combinations of other vectors in both C\ and C 2 , and in such a 
way that all coefficients in the combinations are nonzero. We get thus an equality 

k I 

i=i 3=1 

Combining (possible) repetitive vectors on the left hand side, and noticing that this does not 
make it empty, we see that C\ U C 2 — r contains a circuit. (Note especially the case where either 
C 1 = {r,r}orC 2 = {r,r}.) 

In the special case where E consists of columns (or rows) of a matrix A, a vectorial matroid 
of E is called a matrix matroid and denoted by M(A). For example, the circuit matroid M(G) 
of a graph G is a linear matroid whose representation is obtained using the rows of the circuit 
matrix of G in the binary field GF(2) (see Section 4. 5). 3 Of course, if desired, any vectorial 
matroid of E may be considered as a matrix matroid simply by taking the vectors of E as 
columns (or rows) of a matrix. 4 



3 Hereditary systems with a representation in the binary field GF(2) are called binary matroids. The circuit 
matroid of a graph is thus always binary. 

4 This actually is the origin of the name "matroid". A matroid is a generalization of a linear matroid and a linear 
matroid may be thought of as a matrix. Indeed, not all matroids are linear. The name "matroid" was strongly 
opposed at one time. Even today there are people who prefer to use names like "geometry" or "combinatorial 
geometry". 
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Transversal Matroid 

Let A = {Ai, . . . , Ak] be a family of nonempty finite sets. The transversal matroid M(A) 
is a hereditary system of the set E = A x U • ■ • U A k whose independent sets are exactly all 
subsets of E containing at most one element of each of the sets A\ (including the empty set). 
Here it is customary to allow the family A to be a multiset, that is, a set Ai may appear several 
times as its element, thus allowing more than one element of Ai in an independent set. 

A natural aspect of transversal matroids is augmentation, and it is connected with aug- 
mentings of matchings of bipartite graphs! (See Section 5.8.) Let us define a bipartite graph 
G = (V, E) as follows: The vertex set is V = E U A, and the vertices e and Aj are connected 
by an edge exactly when e G Aj. (Note how the vertex set V is naturally divided into the two 
parts of the cut, E and A.) An independent set of M(A) is then a set of matched vertices of G 
in E, and vice versa. 

Example. In the figure below is the bipartite graph corresponding to the transversal matroid 
of the family {{1, 2}, {2, 3, 4}, {4, 5}}, and its independent set {1, 2, 4} (thick line). 




Very much in the same way as in the proof of Theorem 5.3 one may show that if l\ and I 2 are 
independent sets (vertex sets of the matchings Si and S2) and #(ii) < #(^2) then there is an 
augmenting path of the matching Si such that the new matched vertex is in I 2 . Thus M(A) 
indeed has the augmentation property. 

Remark. For matchings of bipartite graphs the situation is completely general. That is, match- 
ings of bipartite graphs can always be thought of as independent sets of transversal matroids. 
In fact this remains true for matchings of general graphs, too, leading to the so-called matching 
matroids, see e.g. Swamy & Thulasiraman. 

If the sets of the family A are disjoint — i.e. they form a partition of E — then the transversal 
matroid is also called partition matroid. For a partition matroid augmentation is obvious. 

Uniform Matroid 

For all finite sets E one can define the so-called uniform matroids. The uniform matroid of E 
of rank k, denoted Uk(E), is a hereditary system whose independent sets are exactly all subsets 
of E containing at most k elements. The bases of Uk(E) are those subsets containing exactly 
k elements, and the circuits are the subsets containing exactly k + 1 elements. In particular, all 
subsets of E form a uniform matroid of E of rank #(E), this is often called the free matroid of 
E. Quite obviously Uk(E) has the basis exchange property and the augmentation property. 

Uniform matroids are not very interesting as such. They can be used as "building blocks" 
of much more complicated matroids, however. It may also be noted that uniform matroids are 
transversal matroids (can you see why?). 
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7.4 Greedy Algorithm 

Many problems of combinatorial optimization 5 may be thought of as finding a heaviest or a 
lightest independent set of a hereditary system M of E, when each element of E is given a 
weight. The weighting function is a : E — > R and the weight of a set F C £ is 

^a(e). 

The two optimization modes are interchanged when the signs of the weights are reversed. 

One may also find the heaviest or the lightest bases. Again reversing the signs of the weights 
interchanges maximization and minimization. If all bases are of the same size — as will be the 
case for matroids — they can be restricted to the case where there weights are positive. Indeed, 
if A is the smallest weight of an element of E then changing the weight function to 

f3 : /3( e ) = 1 + a(e) - A 

one gets an equivalent optimization problem with positive weights. On the other hand, maxi- 
mization and minimization are interchanged when the weighting function is changed to 

p : /3(e) = 1 + B- a(e) 

where B is the largest weight of an element of E. 

Example. (A bit generalized) Kruskal's Algorithm (see Section 5.6) finds a lightest spanning 
forest of an edge-weighted graph G, i.e. a lightest basis of the circuit matroid of G. As was seen, 
this can be done quite fast — and even faster if the edges are given in the order of increasing 
weight when one can always consider the "best" remaining edge to be included in the forest. 
Kruskal's Algorithm No. 1 is an example of a so-called greedy algorithm that always proceeds 
in the "best" available direction. Such a greedy algorithm is fast, indeed, it only needs to find 
this "best" element to be added in the set already constructed. 

It might be mentioned that Kruskal's Algorithm No. 3 is also a greedy algorithm, it finds a 
heaviest cospanning forest in the dual matroid of the circuit matroid, the so-called bond matroid 
ofG (see Section 7.6). 

Even though greedy algorithms produce the correct result for circuit matroids they do not 
always do so. 

Example. Finding a lightest Hamiltonian circuit of an edge-weighted graph G may also be 
thought of as finding the lightest basis of a hereditary system — assuming of course that there 
are Hamitonian circuits. The set E is again taken to be the edge set of G but now the bases 
are the Hamiltonian circuits of G (considered as edge sets). A lightest basis is then a lightest 
Hamiltonian circuit. As was noticed in Section 5. 7, finding a lightest Hamiltonian circuit is a 
well-known MV -complete problem and no greedy algorithm can thus always produce a ( cor- 
rect) result — at least ifV ^ AfV. The hereditary system thus obtained is in general a matroid, 
however (e.g. it does not generally have the basis exchange property). 

It would thus appear that — at least for matroids — greedy algorithms are favorable methods 
for finding heaviest/lightest bases (or independent sets). Indeed, matroids are precisely those 
hereditary systems for which this holds true. To be able to proceed further we define the greedy 
algorithm formally. We consider first maximization of independent sets, minimization is given 
in brackets. The input is a hereditary system M of the set E, and a weighting function a. 

5 These problems are dealt with more extensively in the course Optimization Theory 2. 
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Greedy Algorithm for Independent Sets: 

1. Sort the elements e 1: . . . , e m of E according to decreasing [increasing] weight: e(i), . . . , 

C(m) • 

2. SetF^0andA;^ 1. 

3. If a(e(fc)) < [a(e(fc)) > 0], return F and quit. 

4. If a(e(fc)) > [a(e(fc)) < 0] and F U {e( fe )} is independent, set F ^— F U {e^)}. 

5. If fc = m, return F and quit. Else set k <— fc + 1 and go to #3. 
For bases the algorithm is even simpler: 

Greedy Algorithm for Bases: 

1. Sort the elements ei, . . . , e m of E according to decreasing [increasing] weight: e^), . . . , 
e( m ). 

2. SetF^0andA;^ 1. 

3. If F U {e( fc )} is independent, set F <— F U {e^)}- 

4. If fc = m, return F and quit. Else set k <— fc + 1 and go to #3. 

The main result that links working of greedy algorithms and matroids is 

Theorem 7.2. (Matroid Greediness Theorem) The greedy algorithm produces a correct heav- 
iest independent set of a hereditary system for all weight functions if and only if the system is 
a matroid. (This is the so-called greediness property.) The corresponding result holds true for 
bases, and also for finding lightest independent sets and bases. Furthermore, in both cases it 
suffices to consider positive weights. 

Proof. The first sentence of the theorem is proved as part of the proof of Theorem 7.3 in the 
next section. 

As noted above, greediness is equivalent for maximization and minimization, for both inde- 
pendent sets and bases. It was also noted that finding a heaviest basis may be restricted to the 
case of positive weights. Since for positive weights a heaviest independent set is automatically 
a basis, greediness for bases follows from greediness for independent sets. 

On the other hand, if greediness holds for bases, it holds for independent sets as well. Max- 
imization of independent sets using the weight function a then corresponds to maximization of 
bases for the positive weight function 

P : (5(e) = l + max(0,a(e)), 

the greedy algorithms behave exactly similarly, item #3 is not activated for independent sets. 
Elements of weight 1 should be removed from the output. □ 

Remark. Greediness is thus also a proper aspect for matroids. For hereditary families of sets 
it is equivalent to usefulness of the greedy algorithm. Certain other similar but more general 
families of sets have their own "greediness theorems". Examples are the so-called greedoids 
and matroid embeddings. 
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7.5 The General Matroid 

Any one of the several aspects above makes a hereditary system a matroid. After proving that 
they are all equivalent, we may define a matroid as a hereditary system that has (any) one of 
these aspects. 

Before that we add one aspect to the list, which is a bit more difficult to prove directly for 
circuits matroids of graphs: 

Submodularity: If M is a hereditary system of the set E and F, F' C E then 
p M (F n F') + p M {F U F') < p M {F) + Pm(F'). 



Let us then prove the equivalences, including submodularity. 

Theorem 7.3. If a hereditary system has (any) one of the nine aspects below then it has them 
all (and is a matroid). 

(i) Uniformity (vi) Submodularity 

(ii) Basis exchange property (vii) Elimination property 

(iii) Augmentation property (viii) Property of induced circuits 

(iv) Weak absorptivity (ix) Greediness 

(v) Strong absorptivity 

Proof. The implications are proved following the strongly connected digraph below: 



r 

(i)- 
T 

(vi) 

I 

(vii) 



(ii) 



(iii) 



(iv) 



'(v) 



■ (viii) ► (ix) 



All nine aspects are then connected by implication chains in both directions, and are thus logi- 
cally equivalent. Let us consider a general hereditary system M of the set E. 

(i) =Kii): As a consequence of uniformity, all bases of M are of the same size. If E>i, B 2 G 
Bm and e 6 B\ — B 2 , we may apply uniformity to the set F = (B\ — e) U B 2 . All maximal 
independent sets included in F are then of the same size as B 2 (and Bi). Now B\ — e is not 
one of these maximal sets having too few elements. On the other hand, by adding one element 
/ to Bi — e we get such an independent set H. The element / must then be in the set difference 
B 2 — B\, so H = B 1 — e + f. Moreover, H has as many elements as B\, and so it is a basis. 

(ii) =Kiii): If h,I 2 G X M and #(ii) < #(-£2), we choose bases B\ and B 2 such that I\ C B\ 
and I 2 C B 2 . Applying basis exchange (repeatedly) we replace those elements of B\ — I\ that 
are not in B 2 by elements of B 2 . After this operation we may assume that B\ — I\ C B 2 . As a 
consequence of the basis exchange property all bases are of the same size. Thus 

#(£?! - h) = #(50 - #(70 > #(S a ) - #(/ 2 ) = #(5 2 - J 2 ), 

and i?i — Ii cannot be included in B 2 — I 2 . Therefore there is an element e of B\ — I\ in I 2 and 
I\ + e is an independent set. 
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(iii) =Kiv): Let us consider a situation where 

Pm(F) = p M {F + e)= p M {F + f). 

If now pm(F + e + f) > pm(F), we take a maximal independent subset I\ of F and a maximal 
inpendent subset I 2 of F + e + f. Then #(/ 2 ) > and by the augmentation property I\ 

can be augmented by an element of I 2 . This element cannot be in F (why not?), so it must be 
either e or /. But then p M {F) < p M (F + e) or p M (F) < p M (F + f) <y). 

(iv) =Kv): Let us assume weak absorptivity and consider subsets F and F' of E such that 
Pm(F + e) = Pm(F) for each element e of F'. We use induction on k = #(F' — F) and show 
that Pm(F) = Pm(F U F') (strong absorptivity). 

Induction Basis : Now k = or k = 1 and the matter is clear. 
Induction Hypothesis: The claimed result holds true when k <£.(£> 1) 
Induction Statement : The claimed result holds true when k = £ + 1. 
Induction Statement Proof : Choose distinct elements e, f E F' — F and denote F" = F' — 
e — /. The Induction Hypothesis implies that 

Pm(F) = p M (F U F") = p M (F U F" + e) = p M (F U F" + /). 

Applying weak absorptivity to this it is seen that 

Pm{F) = p M {F UF" + e + f)= p M (F U F'). 

(v) =>(i): If I is a maximal independent subset of F then pM^+e) = pu (-0 for elements e in 
the set difference F — I (if any). Strong absorptivity implies then that Pm{F) = Pm(I) = #(-0> 
i.e. all these independent sets are of the same size and uniformity holds true. 

(i)=Kvi): Let us consider sets F, F' C E and denote by ii a maximal independent subset of 
the intersection F D F' and by I 2 a maximal inpendent subset of the union F U F' . Uniformity 
implies augmentation, so we may assume that I 2 is obtained from I\ by adding elements, that 
is Ji C I 2 . Now I 2 fl F is an independent subset of F and I 2 fl F' is an independent subset of 
F', and both of them include fx. So 

Pm (f n F') + pmC^ u f') = #(Ji) + #(/ 2 ) 

= #(/ 2 n F) + #(/ 2 n F') < PA/ (F) + PM (F'). 

The equality marked by an asterisk is a set-theoretical one, see the figure below. 




(vi)=Kvii):Let us consider distinct circuits Ci, C 2 G Cm and an element e G Ci fl C 2 . Then 
Pm(Ci) = #(d) - 1 and p M (C 2 ) = #(C 2 ) - 1, and ^(d n C 2 ) = #(d n C 2 ). (Remember 
that every proper subset of a circuit is independent.) If now C\ UC 2 — e does not contain a circuit, 
it is independent and p M (C 1 UC 2 -e) = #(CiUC 2 )-l, whence p M (CiUC 2 ) > #(CiUC 2 )-l. 
Submodularity however implies that 



p M (Ci n C 2 ) + p M (Ci U C 2 ) < p M {Ci) + Pm(C 2 ), 
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and further that (check!) 

#(Ci n c 2 ) + #(d u c 2 ) < #(Ci) + #(c 2 ) - 1. 

This is a set-theoretical impossibility, and thus Ci U C 2 — e does contain a circuit. 

(vii) =Kviii): If J is an independent set and I + e contains two distinct circuits C\ and C 2 
then obviously both C\ and C 2 contain the element e. The elimination property implies that 
C\ U C 2 — e contains a circuit. Since Ci U C 2 — e is however contained in I, it is independent 
(V). So J + e contains at most one circuit. 

(viii) =Kix): Let us denote by I the output of the greedy algorithm for the weighting function 
a. (The problem is finding a heaviest independent set.) If I is a heaviest independent set, then 
the matter is clear. Otherwise we take a heaviest independent set having the largest intersection 
with I. Let us denote this heaviest independent set by /'. I cannot be a subset of /', because the 
greedy algorithm would then find an even heavier independent set. Let us further denote by e the 
first element of the set difference I — I' that the greedy algorithm chooses. I' + e is a dependent 
set and contains thus exactly one circuit C (remember the property of induced circuits). This 
circuit of course is not included in /, so there is an element / e C — I. Since I' + e contains 
only one circuit, /' + e — / is an independent set. /' is maximal, so that ct(f) > a(e). On the 
other hand, / and those elements of I that the greedy algorithm chose before choosing e are all 
in whence adding / to the elements does not create a circuit. This means that / was available 
for the greedy algorithm when it chose e, and so ct(f) < a(e). We conclude that ct(f) = ct(e) 
and the sets I' + e — f and /' have equal weight. This however is contrary to the choice of I' 
because #((/' + e — /) D /) > #(/' D /). (The reader may notice a similarity to the proof of 
Theorem 5.2. Indeed, this gives another proof for Kruskal's Algorithm No. 1.) 

(ix) =Kiii): Let us consider independent sets 7i and I 2 such that < For brevity 
we denote k — Consider then the weighting function 

{k + 2, if e G h 
k + 1, if e e h — h 
otherwise. 

The weight of I 2 is then 

«(e) >{k + l) 2 > k{k + 2) = J2 «( e )- 

e£l 2 eei"i 

It is thus larger than the weight of I±, so 1\ is not a heaviest independent set. On the other 
hand, when finding a heaviest independent set the greedy algorithm will choose all elements of 
I\ before it ever chooses an element of I 2 — h- Since it is now assumed to produce a heaviest 
independent set, it must choose at least one element e of h — h and I±+e is thus an independent 
set. This shows that the augmentation property holds true. □ 

The most popular aspect defining a matroid is probably the augmentation property. 

7.6 Operations on Matroids 

In the preceding chapters, in connection with fundamental cut sets and fundamental circuits, 
mutual duality was mentioned. Duality is a property that is very natural for hereditary systems 
and matroids. 
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The dual (system) M* of a hereditary system M of the set E is a hereditary system of E 
whose bases are the complements of the bases of M (against E). Often the bases of M* are 
called cobases of M, circuits of M* are called cocircuits of M, and so on. It is easily checked 
that M* really is a hereditary system of E: If B 1 and B 2 are distinct bases of M* then B\ and 
B 2 are distinct bases of M. Thus, if Si C 7J 2 then B 2 C Bi (V). Note also that (M*)* = M. 

Theorem 7.4. (Whitney's Theorem) The dual M* of a matroid M is a matroid, the so-called 
dual matroid, and 

Pm*(F) = #(F)-p M (E)+p M (F). 
(Note that pm{E) is the size of a basis of M.) 

Proof Let us show that M* has the basis exchange property, which makes it a matroid accord- 
ing to Theorem 7.3. If B 1 and B 2 are distinct bases of M* and e G B\ — B 2 then B\ and B 2 
are distinct bases of M and e G B 2 — B\. Since Si is a basis of M, B\ + e contains exactly 
one circuit C of M (the property of induced circuits) and this circuit must have an element 
/ G B 2 — Bi. Then however Bi + e — f does not contain a circuit of M, i.e. it is an independent 
set of M, and has the same size as B 1 . All bases have the same size, so B 1 + e — / is a basis of 
M and its complement B\ — e + / is a basis of M*. 

To compute the rank pm*(F) we take a maximal independent set if of M* included in F. 
Then 

PM*(F) = p M *(H) = #(H). 

Then is a minimal set containing the set F and a basis of M. (This is simply the same 
statement in other words. Note that H is included in some basis of M* .) But such a set is 
obtained starting from F, taking a maximal independent set of M contained in F — which has 
p M {F) elements — and extending it to a basis — which has Pm(E) elements. So 

#(17) = p M (E) - p M (F). 

Set theory tells us that 

#(77) + #(#) = #(£) = #(F) + #(F). 
Combining these we get the claimed formula for p M * (F) (check!). □ 

Dualism gives a connection between bases of a matroid M and circuits of its dual matroid 
M* (i.e. cocircuits of M): 

Theorem 7.5. (i) Circuits of the dual matroid of a matroid M are the minimal sets that intersect 
every basis of M. 

(ii) Bases of a matroid M are the minimal sets that intersect every circuit of the dual matroid 
M*. 

Proof (i) The circuits of M* are the minimal sets that are not contained in any complement of 
a basis of M. Thus they must intersect every basis of M. 

(ii) Bases of M* are the maximal sets that do not contain any circuit of M*. The same in 
other words: Bases of M are the minimal sets that intersect every circuit of M*. □ 

Example. Bases of the circuit matroid M(G) of a connected graph G are the spanning trees. 
Bases of the dual matroid M* (G) are the complements of these, i.e. the cospanning trees. By the 
theorem, circuits of the dual matroid are the cut sets ofG. (Cf. Theorems 2.4 and 2.5.) Because 
according to Whitney's Theorem M*(G) is a matroid, it has the greediness property, that is, the 
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greedy algorithm finds a heaviest/lightest basis. Working ofKruskal's Algorithm No. 3 is based 
on this. The algorithm finds the heaviest cospanning tree. 

Analogous concepts can naturally be defined for a general, possibly disconnected, graph G. 
Bases of M*(G) are then the cospanning forests of G. The dual matroid M*(G) is called the 
bond matroid or the cut matroid or the cocircuit matroid of G. So, when is the bond matroid 
M*(G) graphic, i.e. the circuit matroid of a graph? The so-called Whitney Planarity Theorem 
tells us that this happens exactly when G is a planar graph! (See e.g. West. J 

If Mi is a hereditary system of the set E,- L for % = 1, . . . , k then the direct sum M = Mi® 
• • • © M k of the systems Mi, ... , M k is the hereditary system of the set E — Ei U • • • U E k whose 
independent sets are exactly all sets h U • • • U I k where Li E X Mi (i — 1, • • • , k). In particular, 
if Ei = ■ ■ ■ = E k = E then the direct sum M is called the union of the systems Mi, . . . , Mk, 
denoted by M = Mi U ■ • • U M k . Note that each hereditary system M; could also be thought of 
as a hereditary system of the set E simply by adding elements of E — E,- L as circuits (loops, that 
is). ' " ~ 

It is not exactly difficult to see that if Mi , . . . , Mk are matroids and the sets E 1 , . . . , E k are 
pairwise disjoint then M = Mi © • • • © Mk is a matroid, say, by demonstrating the augmentation 
property (try it!). But actually a more general result holds true: 

Theorem 7.6. (Matroid Union Theorem 6 ) If Mi ... , M k are matroids of the set E then the 
union M = Mi U • • • U M k is also a matroid of E and 

k 

PM : Pm(F) = min (#(F - F') + ^p Mi (F')). 

i=i 

Proof. The proof is rather long and difficult, and is not given here (see e.g. West or OxLEY.) 
It might be mentioned, though, that the rank formula is not valid for hereditary systems in 
general. □ 

The theorem has many fundamental corollaries, e.g. 

Corollary. (Matroid Covering Theorem 7 ) If M is a loopless matroid of the set E then the 
smallest number of independent sets whose union equals E is 

jjFV 
Pm(F) 

Proof. Note first that since M is loopless, each element of E is in itself an independent set. The 
set E thus can be covered as stated. Take now k copies of M as the matroids Mi, ... , M k in the 
union theorem. Then E is a union of k independent sets of M exactly when it is an independent 
set of the union matroid M' = Mi U • • • U Mk. The covering property we are interested in can 
then be expressed in the form p M >(E) = #(£') or, by the union theorem, 

k 

4{E) = min (#(£ - F) + ^Pm.(F)) 

i=i 

i.e. 

min(^ M (F) - #(F)) = 0. 

Since the difference to be minimized is = when F is the empty set, k will be the smallest 
number such that k > #(F)/p M (F) for all nonempty subsets F C E. □ 

6 Also known by the names Edmonds-Fulkerson Theorem and Matroid Sum Theorem. 
7 Also known as Edmonds' Covering Theorem. 
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Example. For the circuit matroid M(G) of a loopless graph G independent sets are the sub- 
forests of G, and we are interested in the minimum number of subforests needed to contain all 
edges of G. Let us denote this number by A(G), it is called the arboricity of G. 

To analyze the maximization in the covering theorem we divide the subgraph (F) induced 
by the edges in F into its components. Numbers of vertices and edges of these components are 
denoted by rii, . . . , n kp and mi, ... , m kp , respectively. We use an indexing such that 



n kp - 1 rik F -i ~ l n x 
Now, in general if ' — > — then — > Xl ~*~ X<2 . Thus 

V2 yi V2 yi + y2 

m 2 mi + m 2 



n 2 -l ni + n 2 -2' 
and continuing inductively, also 



m mH Ymi 

> ; ; : (i = 1, . . . ,k F ). 



In particular then 



rii-1 ni H h rii 



m kF mH Vm kp _ #(F) 



n kF -l ni H Vn kp -k F p M (G)(F) ' 

Maximization can thus be restricted to edge-sets F such that (F) is connected and Pm(g) (F) = 
np — 1 where rip is the number of vertices of F. (It might be further restricted to edge-sets F 
such that (F) also equals the subgraph induced by its vertices, since connecting two vertices by 
an edge increases the numerator of the fraction to be maximized, the denominator remaining 
the same.) Thus we get the celebrated Nash-Williams Formula for arboricity: 



A(G) = max 



FCE 



jjF) 
rip — 1 



It might be noted that since for a simple planar graph < "irip — 6 {Linear Bound 

applied to (F) ), A(G) is then at most 3. 

The restriction of a hereditary system M of the set E into the set F C E is a hereditary 
system M\F whose independent sets are exactly those subsets of F that are independent sets 
of M. The contraction of M into the set F is the hereditary system (M*\F)*, often denoted by 
M.F. Clearly the augmentation property of M is directly transferred to M\F, so (cf. Whitney's 
Theorem) 

Theorem 7.7. If M is a matroid of the set E and FCE then M\F and M.F are bot matroids, 
too. 



The minors of a matroid M are all those matroids that can be obtained from M by consecutive 
restrictions and contractions. 
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